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Combinatorial libraries of proteins having the scaffold 
structure of C-type lectin-like domains 



FIELD OF THE INVENTION 

This invention describes a system which relates to the gen- 
eration of randomised libraries of ligand-binding protein 
units derived from proteins containing the so-called C-type 
lectin like domain (CTLD) of which the carbohydrate recog- 
nition domain (CRD) of C-type lectins represents one exam- 
ple of a family of this protein domain. 

BACKGROUND OF THE INVENTION 

The C-type lectin-like domain (CTLD) is a protein domain 
family which has been identified in a number of proteins 
isolated from many animal species (reviewed in Drickamer 
and Taylor (1993) and Drickamer (1999)). Initially, the 
CTLD domain was identified as a domain common to the so- 
called C-type lectins (calcium-dependent carbohydrate bind- 
ing proteins) and named "Carbohydrate Recognition Domain" 
("CRD"). More recently, it has become evident that this do- 
main is shared among many eukaryotic proteins, of which 
several do not bind sugar moieties, and hence, the canoni- 
cal domain has been named as CTLD . 

CTLDs have been reported to bind a wide diversity of com- 
pounds, including carbohydrates, lipids, proteins, and even 
25 ice [Aspberg et al . (1997), Bettler et al . (1992), Ewart et 
al. (1998), Graversen et al. (1998), Mizumo et al. (1997), 
■Sano et al. (1998), and Tormo et al. (1999)]. Only one copy 
of the CTLD is present in some proteins, whereas other pro- 
teins contain from two to multiple copies of the domain. In 
30 the physiologically functional unit multiplicity in the 

number of CTLDs is often achieved by assembling single copy 
protein protomers into larger structures. 
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The CTLD consists of approximately 120 amino acid residues 
and, characteristically, contains two or three intra-chain 
disulfide bridges. Although the similarity at the amino 
acid sequence level between CTLDs from different proteins 
5 is relatively low, the 3D-structures of a number of CTLDs 
have been found to be highly conserved, with the structural 
variability essentially confined to a so-called loop- 
region, often defined by up to five loops. Several CTLDs 
contain either one or two binding sites for calcium and 
10 most of the side chains which interact with calcium are lo- 
cated in the loop-region. 

On the basis of CTLDs for which 3D structural information 
is available, it has been inferred that the canonical CTLD 
is structurally characterised by seven main secondary- 

15 structure elements (i.e. five p-strands and two a-helices) 
sequentially appearing in the order (31; eel; a2; P2; p3; (34; 
and (35 (Fig. 1, and references given therein) . In all 
CTLDs, for which 3D structures have been determined, the 0- 
strands are arranged in two anti-parallel p-sheets, one 

20 composed of pi and p5, the other composed of p2, P3 and P4 . 
An additional p-strand, p0, often precedes pi in the se- 
quence and, where present, forms an additional strand inte- 
grating with the pi, P5-sheet. Further, two disulfide 
bridges, one connecting otl and P5 (Ci-Civ, Fig. 1) and one 

25 connecting P3 and the polypeptide segment connecting P4 and 
P5 (Cii-Cm, Fig. 1) are invariantly found in all CTLDs 
characterised so far. In the CTLD 3D-structure, these con- 
served secondary structure elements form a compact scaffold 
for a number of loops, which in the present context collec- 

30 tively are referred to as the * loop-region" , protruding out 
from the core. These loops are in the primary structure of 
the CTLDs organised in two segments, loop segment A, LSA, 
and loop segment B, LSB. LSA represents the long polypep- 
tide segment connecting p2 and P3 which often lacks regular 

35 secondary structure and contains 'up to four loops. LSB 
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represents the polypeptide segment connecting the p-strands 
(33 and (34. Residues in LSA, together with single residues 
in f!4,'have been shown to specify the Ca 2+ - . and ligand- 
binding sites of several CTLDs, including that of tetranec- 
5 tin. E.g. mutagenesis studies, involving substitution of 

single or a few residues, have shown, that changes in bind- 
ing specificity, Ca 2+ -sensitivity and/or affinity can be 
accommodated by CTLD domains [Weis and Drickamer (1996), 
Chiba et al . (1999), Graver-sen et al . (2000)]. 

As noted above, overall sequence similarities between CTLDs 
are often limited, as assessed e.g. by aligning a prospec- 
tive CTLD sequence with the group of structure-character- 
ized CTLDs presented in Fig. 1, using sequence alignment 
procedures and analysis tools in common use in the field of 
protein science. In such an alignment, typically 22-30% of 
the residues of the prospective CTLD will be identical with 
the corresponding residue in at least one of the structure- 
characterized CTLDs. The sequence alignment shown in Fig. 1 
was strictly elucidated from actual 3D structure data, so 
the fact that the polypeptide segments of corresponding 
structural elements of the framework also exhibit strong 
sequence similarities provide a set of direct sequence- 
structure signatures, which can readily be inferred from 
the sequence alignment. 

25 The implication is that also CTLDs, for which precise 3D 
structural information is not yet available, can nonethe- 
less be used as frameworks in the construction of new 
classes of CTLD libraries. The specific additional steps 
involved in preparing starting materials for the construcr 

30 tion of such a new class of CTLD library on the basis of a 
CTLD, for which no precise 3D structure is available, would 
be the following: (1) Alignment of the sequence of the new 
CTLD with the sequence shown in Fig. 1; and (2) Assignment 
of approximate locations of framework structural elements 
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as guided by the sequence alignment, observing any require- 
ment for minor adjustment of the alignment to ensure pre- 
cise alignment of the four canonical cysteine residues in- 
volved in the formation of the two conserved disulfide 
5 bridges (Ci-Civ and Cu-Cm, in Fig. 1) • The main objective 
of these steps would be to identify the sequence location 
of the loop-region of the new CTLD, as flanked in the se- 
quence by segments corresponding to the 02-, 03- and 04- 
strands. To provide further guidance in this the results of 

10 an analysis of the sequences of 29 bona fide CTLDs are 

given in Table 1 below in the form of typical tetrapeptide 
sequences, and their consensus sequences, found as parts of 
CTLD 02- and 03-strands, and the precise location of the 
04-strand by position and sequence characteristics as elu- 

15 cidated. 
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Of the 29 (32-strands, 

14 were found to conform to the consensus sequence 
WIGX (of which 12 were WIGL sequences, 1 was a WIGI 
sequence and 1 was a WIGV sequence); 

5 3 were found to conform to the consensus sequence WLGX 

(of which 1 was a WLGL sequence, 1 was a WLGV sequence 
and 1 was a WLGA sequence) ; 

3 were found to be WMGL sequences; 

3 were found to conform to the consensus sequence YLXM 
10 (of which 2 were YLSM sequences and 1 was an YLGM se- 

quence) ; 

2 were found to conform to the consensus sequence WVGX 
(of which 1 was a WVGL sequence and 1 was a WVGA se- 
quence) ; and 

15 the sequences of the remaining 4 P2-strand.s in the 

collection were FLGI, FVGL, FIGV and FLSM sequences, 
respectively. 

Therefore, it is concluded that the four-residue 02 consen- 
sus sequence ("|32cseq") may be specified as follows: 

20 Residue 1: An aromatic residue, most preferably Trp, 

less preferably Phe and least preferably 
Tyr . 

Residue 2: An aliphatic or non-polar residue, most 

preferably lie, less preferably Leu or Met 
25 and least preferably Val. 



Residue 3: An aliphatic or hydrophilic residue, most 
preferably Gly and least preferably Ser. 
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.Residue 4: An aliphatic or non-polar residue, most 

preferably Leu and less preferably Met, Val 
or lie. 

Accordingly the {32 consensus sequence may be summarized as 
5 follows: 

p2cseq : (W, Y, F) - (I, L, V,M) - (G, S) - (L,M, V, I ) , 

where the underlined residue denotes the most commonly 
found residue at that sequence position. 

All 29 p3-strands analysed are initiated with the Cysn 
10 residue canonical for all known CTLD sequences, and of the 
29 (33-strands, 

5 were found to conform to the consensus sequence CVXI 
(of which 3 were CVEI sequences, 1 was a CVTI sequence 
and 1 was a CVQI sequence) ; 

15 4 were found to conform to the consensus sequence CVXM 

(of which 2 were CVEM sequences, 1 was a CWM sequence 
and 1 was a CVMM sequence) ; 

6 were found to conform to the consensus sequence CVXL 
(of which 2 were CWL sequences, 2 were a CVSL se- 

20 quence, 1 was a CVHL sequence and 1 was CVAL se- 

quence) ; 

3 were found to conform to the consensus sequence CAXL 
(of which 2 were CAVL sequences and 1 was a CASL se- 
quence) ; 



2 were found to conform to the consensus sequence CAXF 
(of which 1 was 1 CAHF sequence and 1 was a CAEF se- 
quence) / 
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2 were found to conform to the consensus sequence CLXL 
(of which 1 was a CLEL sequence and 1 was a CLGL se- 
quence); and 

the sequences of the remaining 7 (33-strands in the 
5 collection were CVYF, CVAQ, CAHV, CAM, CLEI, CIAY, 

and CMLL sequences, respectively. 

Therefore, it is concluded that the four-residue 03 consen- 
sus sequence ("P3cseq M ) may be specified as follows: 

Residue 1: Cys, being the canonical Cysn residue of 
10 CTLDs 

Residue 2: An aliphatic or non-polar residue, most 

preferably Val, less preferably Ala or Leu 
and least preferably lie or Met 

Residue 3; Most commonly an aliphatic or charged resi- 
15 due, which most preferably is Glu 

Residue 4 : Most commonly an aliphatic, non-polar, or 
aromatic residue, most preferably Leu or 
lie, less preferably Met or Phe and least 
preferably Tyr or Val. 

20 Accordingly the 03 consensus sequence may be summarized as 
follows : 

(33cseq: (C) - (V, A, L, I,M) - (E, X) - (L, I,M, F, Y, V) , 

where the underlined residue denotes the most commonly 
found residue at that sequence position. 

25 It is observed from the known 3D-structures of CTLDs (Fig. 
1), that the p4-strands most often are comprised by five 
residues located in the primary structure at positions -6 
to -2 relative to the canonical Cysm residue of all known 
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CTLDs, and less often are comprised by four residues lo- 
cated at positions -5 to -2 relative to the canonical Cys n i 
residue of all known CTLDs. The residue located at position 
-3, relative to Cysm, is involved in co-ordination of the 
5 site- 2 calcium ion in CTLDs housing this site, and this no- 
tion is reflected in the observation, that of the 29 CTLD 
sequences analysed in Table 1, 27 have an Asp-residue or an 
Asn-residue at this position, whereas 2 CTLDs have a Ser at 
this position. From the known CTLD 3D-structures it is also 

10 noted, that the residue located at position -5, relative to 
the Cysm residue, is involved in the formation of the hy- 
drophobic core of the CTLD scaffold. This notion is re- 
flected in the observation, that of the 29 CTLD sequences 
analysed 25 have a Trp-residue, 3 have a Leu-residue,' and 1 

15 an Ala-residue at this position. 18 of the 29 CTLD se- 
quences analysed have an Asn-residue at position -4 . Fur- 
ther, 19 of the 29 p4-strand segments are preceded by a Gly 
residue . 

Of the 29 central three residue motifs located at positions 
20 -5, -4 and -3 relative to the canonical Cysni residue in 
the (34-strand: 

22 were of the sequence WXD (18 were WND, 2 were WKD, 

1 was WFD and 1 was WWD) , 

2 were of the sequence WXN (1 was WW and 1 was WSN) , 

25 and the remaining 5 motifs (WRS, LDD, LDN, LKS and 

ALD) were each represented once in the analysis. 

It has now been found that each member of the family of 
CTLD domains represents an attractive opportunity for the 
construction of new protein libraries from which members 
30 with affinity for new ligand targets can be identified and 
isolated using screening or selection methods. Such librar- 
ies may be constructed by combining a CTLD framework struc- 
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ture in which the CTLD's loop-region is partially or com- 
pletely replaced with one or more randomised polypeptide 
segments. 

One such system, where the protein used as scaffold is 
5. tetranectin or the CTLD domain of tetranectin, is envisaged 
as a system of particular interest, not least because the 
stability of the trimeric complex of tetranectin protomers 
is very high (International Patent Application Publication 
No. WO 98/56906 A2 ) . 

10 Tetranectin is a trimeric glycoprotein [Holtet et al. 

(1997), Nielsen et al . (1997)], which has been isolated 
from human plasma and found to be present in the extra- 
cellular matrix in certain tissues. Tetranectin is known to 
bind calcium, complex polysaccharides, plasminogen, fi- 

15 brinogen/f ibrin, and apolipoprotein (a) . The interaction 

with plasminogen and apolipoprotein (a) is mediated by the 
so-called kringle 4 protein domain therein. This interac- 
tion is known to be sensitive to calcium and to derivatives 
of the amino acid lysine [Graversen et al. (1998)]. 

20 A human tetranectin gene has been characterised, and both 
human and murine tetranectin cDNA clones have been iso- 
lated. Both the human and the murine mature protein com- 
prise 181 amino acid residues (Fig. 2) . The 3D-^structures 
of full length recombinant human tetranectin and of the 

25 isolated tetranectin CTLD have been determined independ- 
ently in two separate studies [Nielsen et al . (1997) and. 
Kastrup et al. (1998)]. Tetranectin is a two- or possibly 
three-domain protein, i.e. the main part of the polypeptide 
chain comprises the CTLD (amino acid residues Gly53 to 

30 VallSl), whereas the region Leu26 to Lys52 encodes an al- 
pha-helix governing trimerisation of the protein via the 
formation of a homotrimeric parallel coiled coil. The poly- 
peptide segment Glul to Glu25 contains the binding site for 
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complex polysaccharides (Lys6 to Lysl5) [Lorentsen et al. 
(2000)] and appears to contribute to stabilisation of the 
trimeric structure [Holtet et al. (1997)]. The two amino 
acid residues Lysl48 and Glul50, localised in loop 4, and 
5 Aspl65 (localised in |34) have been shown to be of critical 
importance for plasminogen kringle 4 binding, whereas the 
residues Ilel40 (in loop 3) and Lysl66 and Argl67 (in 04) 
have been shown to be of some importance [Graversen et al . 
(1998)]. Substitution of Thrl49 (in loop 4) with an aro- 
10 matic residue has been shown to significantly increase af- 
finity of tetranectin to kringle 4 and to increase affinity 
for plasminogen kringle 2 to a level comparable to the af- 
finity of wild type tetranectin for kringle 4 [Graversen et 
al. (2000)]. 

15 OBJECT OF THE INVENTION 

The object of the invention is to provide a new practicable 
method for the generation of useful protein products en- 
dowed with binding sites able to bind substance of interest 
with high affinity and specificity. 

20 The invention describes one way in which such new and use- 
ful protein products may advantageously be obtained by ap- 
plying standard combinatorial protein chemistry methods/ 
commonly used in the recombinant antibody field, to gener- 
ate randomised combinatorial libraries of protein modules, 

25 in which each member contains an essentially common core 
structure similar to that of a CTLD. 

The variation of binding site configuration among naturally 
occurring CTLDs shows that their common core structure can 
accommodate many essentially different configurations of 
30 the ligand binding site. CTLDs are therefore particularly 
well suited to serve as a basis for constructing such new 
and useful protein products with desired binding proper- 
ties . 
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In terms of practical application, the new artificial CTLD 
protein products can be employed in applications in which 
antibody products are presently used as key reagents in 
technical biochemical assay systems or medical in vitro or 
5 in vivo diagnostic assay systems or as active components in 
therapeutic compositions. 

In terms of use as components of in vitro assay systems, 
the artificial CTLD protein products are preferable to an- 
tibody derivatives as each binding site in the new protein 

10 product is harboured in a single structurally autonomous 
protein domain. CTLD domains are resistant to proteolysis, 
and neither stability nor access to the ligand-binding site 
is compromised by the attachment of other protein domains 
to the N- or C-terminus of the CTLD. Accordingly, the CTLD 

15 binding module may readily be utilized as a building block 
for the construction of modular molecular assemblies, e.g. 
harbouring multiple CLTDs of identical or nonidentical 
specificity in addition to appropriate reporter modules 
like peroxidases, phosphatases or any other signal- 

20 mediating moiety. 

In terms of in vivo use as essential component of composi- 
tions to be used for in vivo diagnostic or therapeutic pur- 
poses, artificial CTLD protein products constructed on the 
basis of human CTLDs are virtually identical to the corre- 

25 sponding natural CTLD protein already present in the body, 
and are therefore expected to elicit minimal immunological 
response in the patient. Single CTLDs are about half the 
mass of the smallest functional antibody derivative, the 
single-chain Fv derivative, and this small size may in some 

30 applications be advantageous as it may provide better tis- 
sue penetration and distribution, as well as a shorter 
half-life in circulation. Multivalent formats of CTLD pro- 
teins, e.g. corresponding to the complete tetranectin 
trimer or the further multimerized collectins, like e.g. 
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mannose binding protein, provide increased binding capacity 
and avidity and longer circulation half-life. 

One particular advantage of the preferred embodiment of the 
invention, arises from the fact that mammalian tetranec- 
5 tins, as exemplified by murine and human tetranectin, are 
of essentially identical structure. This conservation among 
species is of great practical importance as it allows 
straightforward swapping of polypeptide segments defining 
ligand-binding specificity between e.g. murine and human 
10 tetranectin derivatives. The- option of facile swapping of 
species genetic background between tetranectin derivatives 
is in marked contrast to the well-known complications of 
effecting the "humanisation" of murine antibody deriva- 
tives. 

15 Further advantages of the invention are: 

The availability of a general and simple procedure for re- 
liable conversion of an initially selected protein deriva- 
tive into a final protein product, which without further 
reformatting may be produced in bacteria (e.g. Escherichia 
20 coli) both in small and in large scale (International Pat- 
ent Application Publication No. WO 94/18227 A2) . 

The option of including several identical or non-identical 
binding sites in the same functional protein unit by simple 
and general means, thereby enabling the exploitation even 
25 of weak affinities by means of avidity in the interaction, 
or the construction of bi- or heterof unctional molecular 
assemblies (International Patent Application Publication 
No. WO 98/56906 A2) . 

The possibility of modulating binding by addition or re- 
30 moval of divalent metal ions (e.g. calcium ions) in combi- 
national libraries with one or more preserved metal binding 
site (s) in the CTLDs . 
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SUMMARY OF THE INVENTION 

The present invention provides a great number of novel and 
useful proteins each being a protein having the scaffold 
structure of Otype lectin-like domains (CTLD) , said pro- 
5 tein comprising a variant of a model CTLD wherein the a- 

helices and 3*strands and connecting segments are conserved 
to such a degree that the scaffold structure of the CTLD is 
substantially maintained, while the loop region is altered 
by amino acid substitution, deletion, insertion or any 
10 combination thereof, with the proviso that said protein is 
not any of the known CTLD loop derivatives of C-type 
lectin-like proteins or C-type lectins listed in the 
following Table 2. 
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Normally the model CTLD is defined by having a 3D struc- 
ture that conforms to the secondary-structure arrangement 
illustrated in Fig. 1 characterized by the following main 
secondary structure elements: 

5 five p-strands and two cc-helices sequentially appear- 

ing in the order pi, ocl, <x2, 02, 03, 04, and 05, the 
P-strands being arranged in two anti-parallel 0- 
sheets, one composed of 01 and 05, the other composed 
of 02, 03 and 04, 

10 at least two disulfide bridges, one connecting ccl and 

P5 and one connecting 03 and the polypeptide segment 
connecting 04 and 05, 

a loop region consisting of two polypeptide segments, 
loop segment A (LSA) connecting 02 and 03 and compris- 
15 ing typically 15-70 or, less typically, 5-14 amino 

acid residues, and loop segment B (LSB) connecting 03 
and 04 and comprising typically 5-12 or less typi- 
cally, 2-4 amino acid residues. 

However, also a CTLD, for which no precise 3D structure 
2 0 is available, can be used as a model CTLD, such CTLD be- 
ing defined by showing sequence similarity to a previ- 
ously recognised member of the CTLD family as expressed 
by an amino acid sequence identity of at least 22 %, 
preferably at least 25 % and more preferably at least 30 
25 %, and by containing the cysteine residues necessary for 
establishing the conserved two-disulf ide bridge topology 
(i.e. Cysi, Cysn, Cysm and Cysiv) . The loop region, con- 
sisting of the loop segments LSA and LSB, and its flank- 
ing 0-strand structural elements can then be identified 
30 -by inspection of the sequence alignment with the collec- 
tion of CTLDs shown in Fig. 1, which provides identifica- 
tion of the sequence locations of the 02- and 03-strands 
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with the further corroboration provided by comparison of 
these sequences with the four-residue consensus se- 
quences, 02cseq and (33cseq, and the 04 strand segment lo- 
cated typically at positions -6 to -2 and less typically 
5 at positions -5 to -2 relative to the conserved Cysm 

residue and with the characteristic residues at positions 
-5 and -3 as elucidated from Table 1 and deducted above 
under BACKGROUND OF THE INVENTION. 

The same considerations apply for determining whether in 
10 a model CTLD the cx-helices and 0-strands and connecting 

segments are conserved to such a degree that the scaffold 
structure of the CTLD is substantially maintained. 

It may be desirable that up to 10, preferably up to 4, 
and more preferably 1 or 2, amino acid residues are 

15 substituted, deleted or inserted in the a-helices and/or 
(3-strands and/or connecting segments of the model CTLD, 
In particular, changes of up to 4 residues may be made in 
the 0-strands of the model CTLD as a consequence of the 
introduction of recognition sites for one or more re- 

20 striction endonucleases in the nucleotide sequence encod- 
ing the CTLD to facilitate the excision of part or all of 
the loop region and the insertion of an altered amino 
acid sequence instead while the scaffold structure of the 
CTLD is substantially maintained. 

25 Of particular interest are proteins wherein the model 
CTLD is that of a tetranectin. Well known tetranectins 
the CTLDs of which can be used as model CTLDs are human 
tetranectin and murine tetranectin. The proteins accord- 
ing to the invention thus comprise variants of such model 

30 CTLDs. 

The proteins according to the invention may comprise N- 
terminal and/or C-terminal extensions of the CTLD vari- 
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ant, and such extensions may for example contain effec- 
tor, enzyme, further binding and/or multimerising func- 
tions. In particular, said extension may be the non-CTLD- 
portions of a native C-type lectin-like protein or C-type 
5 lectin or a "soluble" variant thereof lacking a func- 
tional transmembrane domain. 

The proteins according to the invention may also be mul- 
timers of a moiety comprising the CTLD variant, e.g. de- 
rivatives of the native tetranectin trimer. 

In a preferred aspect the present invention provides a 
combinatorial library of proteins having the scaffold 
structure of C-type lectin-like domains (CTLD), said pro- 
teins comprising variants of a model CTLD wherein the a- 
helices and p-strands are conserved to such a degree that 
the scaffold structure of the CTLD is substantially main- 
tained, while the loop region or parts of the loop region 
of the CTLD is randomised with respect to amino acid se- 
quence and/or number of amino acid residues. 

The proteins making up such a library comprise variants 
20 of model CTLDs defined as for the above proteins accord- 
ing to the invention, and the variants may include the 
changes stated for those proteins. 

In particular, the combinatorial library according to the 
invention may consist of proteins wherein the model CTLD 
25 is that of a tetranectin, e.g. that of human tetranectin 
or that of murine tetranectin. 

The combinatorial library according to the invention may 
consist of proteins comprising N-terminal and/or C- 
terminal extensions of the CTLD variant, and such exten- 
30 sions may for example contain effector, enzyme, further 
binding and/or multimerising functions. In particular, 
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said extensions may be the non-CTLD-portions of a native 
C-type lectin-like protein or Otype lectin or a "solu- 
ble" variant thereof lacking a functional transmembrane 
domain. 

5 The combinatorial library according to the invention may 
also consist of proteins that are multimers of a moiety 
comprising the CTLD variant, e.g. derivatives of the na- 
tive tetranectin trimer. 

The present invention also provides derivatives of a na- 
10 tive tetranectin wherein up to 10, preferably up to 4, 

and more preferably 1 or 2, amino acid residues are sub- 
stituted, deleted or inserted in the cc-helices and/or 0- 
strands and/or connecting segments of its CTLD as well as 
nucleic acids encoding such derivatives. Specific deriva- 
15 tives appear from SEQ ID Nos: 02, 04, 09, 11, 13, 15, 29, 
31, 36, and 38/ and nucleic acids comprising nucleotide 
inserts encoding specific tetranectin derivatives appear 
from SEQ ID Nos: 12, 14, 35, and 37. 

The invention comprises a method of constructing a tetra- 
20 neetin derivative adapted for the preparation of a combi- 
natorial library according to the invention, wherein the 
nucleic acid encoding the tetranectin derivative has been 
modified to generate endonuclease restriction sites 
within nucleic acid segments encoding (32, (33 or (34, or 
25 up to 30 nucleotides upstream or downstream in the se- 
quence from any nucleotide which belongs to a nucleic 
acid segment encoding p2, £3 or (34. 

•The invention also comprises the use of a nucleotide se- 
quence encoding a tetranectin, or a derivative thereof 
30 wherein the scaffold structure of its CTLD is substan- 
tially maintained, 'for preparing a library of nucleotide 
sequences encoding related proteins by randomising part 
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or all of the nucleic acid sequence encoding the loop re- 
gion of its CTLD. 

Further, the present invention provides nucleic acid com- 
prising any nucleotide sequence encoding a protein ac- 
5 cording to the invention. 

In particular, the invention provides a library of nu- 
cleic acids encoding proteins of a combinatorial library 
according to the invention, in which the members of the 
ensemble of nucleic acids, that collectively constitute 
10 said library of nucleic acids, are able to be expressed 
in a display system, which provides for a logical, physi- 
cal or chemical link between entities displaying pheno- 
types representing properties of the displayed expression 
products and their corresponding genotypes. 

15 In such a library the display system may be selected from 



(I) 



a phage display system such as 
(1) a filamentous phage fd in which the li- 
brary of nucleic acids is inserted into 



20 



(a) a phagemid vector, 

(b) the viral genome of a phage 

(c) purified viral nucleic acid in pu- 



rified single- or double-stranded 
form, or 

(2) a phage lambda in which the library is in- 



25 



serted into 



(a) purified phage lambda DNA, or 

(b) the nucleic acid in lambda phage 

particles; or 



30 



(ID 



a viral display system in which the library of 
nucleic acids is inserted into the viral nu- 
cleic acid of a eukaryotic virus such as bacu- 
lovirus; or 
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(III) a cell-based display system in which the li- 
brary of nucleic acids is inserted into, or ad- 
joined to, a nucleic acid carrier able to inte- 
grate either into the host genome or into an 

5 extrachromosomal element able to maintain and 

express itself within the cell and suitable for 
cell-surface display on the surface of 

(a) bacterial cells, 

(b) yeast cells, or 

10 (c) mammalian cells; or 

(IV) a nucleic acid entity suitable for ribosome 
linked display into which the library of nu- 
cleic acid is inserted; or 

(V) a plasmid suitable for plasmid linked display 
15 into which the library of nucleic acid is in- 
serted. 

A well-known and useful display system is the "Recombi- 
nant Phage Antibody System" with the phagemid vector 
"pCANTAB 5E M supplied by Amersham Pharmacia Biotech (code 
20 no. 27-9401-01) . 

Further, the present invention provides a method of 
preparing a protein according to the invention, wherein 
the protein comprises at least one or more, identical or 
not identical, CTLD domains with novel loop-region se- 

25 quences which has (have) been isolated from one or more 
CTLD libraries by screening or selection. At least one 
such CTLD domain may have been further modified by 
mutagenesis; and the protein containing at least one CTLD 
domain may have been assembled from two or more 

30 components by chemical or enzymatic coupling or 
crosslinking . 

Also, the present invention provides a method of prepar- 
ing a combinatorial library according to the invention 
comprising the following steps: 
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1) inserting nucleic acid encoding a protein compris- 
ing a model CTLD into a suitable vector, 

2) if necessary, introducing restriction endonuclease 
recognition sites by site directed mutagenesis, 

5 said recognition sites being properly located in 

the sequence at or close to the ends of the se- 
quence encoding the loop region of the CTLD or part 
thereof, 

3) excising the DNA fragment encoding the loop region 
10 or part thereof by use of the proper restriction 

endonucleases, 

4) ligating mixtures of DNA fragments into the re- 
stricted vector, and 

5) inducing the vector to express randomised proteins 
15 having the scaffold structure of CTLDs in a suit- 
able medium. 

In a further aspect, the present invention provides a 
method of screening a combinatorial library according to 
the invention for binding to a specific target which com- 
20 prises the following steps: 



1) 



expressing a nucleic acids library according to any 
one of claims 59-61 to display the library of pro- 
teins in the display system; 



2) 



contacting the collection of entities displayed 
with a suitably tagged target substance for which 



25 



isolation of a CTLD-derived exhibiting affinity for 
said target substance is desired; 



3) 



harvesting subpopulations of the entities displayed 
that exhibit affinity for said target substance by 
means of affinity-based selective extractions, 



30 



utilizing the tag to which said target substance is 
conjugated or physically attached or adhering to as 
a vehicle or means of affinity purification, a pro- 
cedure commonly referred to in the field as "affin- 
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ity panning", followed by re-amplification of the 
sub-library; 

4) isolating progressively better binders by repeated 
rounds of panning and re-amplification until a 

5 suitably small number of good candidate binders is 

obtained; and, 

5) if desired, isolating each of the good candidates 
as an individual clone and subjecting it to ordi- 
nary functional and structural characterisation in 

10 preparation for final selection of one or more pre- 

ferred product clones. 

In a still further aspect, the present invention provides 
a method of reformatting a protein according to the in- 
vention or selected from a combinatorial library accord- 

15 ing to the invention and containing a CTLD variant exhib- 
iting desired binding properties, in a desired alterna- 
tive species-compatible framework by excising the nucleic 
acid fragment encoding the loop region-substituting poly- 
peptide and any required single framework mutations from 

20 the nucleic acid encoding said protein using PCR technol- 
ogy, site directed mutagenesis or restriction enzyme di- 
gestion and inserting said nucleic acid fragment into the 
appropriate location (s) in a display- or protein expres- 
sion vector that harbours a nucleic acid sequence encod- 

25 ing the desired alternative CTLD framework. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows an alignment of the amino acid sequences of 
ten CTLDs of known 3D-structure . The sequence locations 
of main secondary structure elements are indicated above 
30 each sequence, labelled in sequential numerical order as 
"aN", denoting oc-helix number N, and "pM", denoting (3- 
strand number M. 
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The four cysteine residues involved in the formation of 
the two conserved disulfide bridges of CTLDs are indi- 
cated and enumerated in the Figure as "Ci", "Cn", "Cm" 
and "Civ", respectively. The two conserved disulfide 
5 bridges are Ci-Civ and CirCm, respectively. 



The ten C-type lectins are 

hTN: human tetranectin [Nielsen et al . (1997)]; 

MBP: mannose binding protein [Weis et al. (1991); She- 
riff et al. (1994) ] ; 

10 SP-D: surfactant protein D [Hakansson et al . (1999)]; 

LY49A: NK receptor LY49A [Tormo et al. (1999)]; 

Hl-ASR: HI subunit of the asialoglycoprotein receptor 
[Meier et al. (2000) ] ; 

MMR-4: macrophage mannose receptor domain 4 [Feinberg et 
15 al. (2000)]; 

IX-A and IX-B: coagul'ation factors IX/X-binding protein 
domain A and B, respectively [Mizuno et al. 
(1997)]; 

Lit: lithostatine [Bertrand et al. (1996)]; 

20 TU14: tunicate C-type lectin [Poget et al. (1999)]. 

Fig. 2 shows an alignment of the nucleotide and amino 
acid sequences of the coding regions of the mature forms 
of human and murine tetranectin with an indication of 
known secondary structural elements. 



25 hTN: human tetranectin; nucleotide sequence from Berglund 
and Petersen (1992) . 
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mTN: murine tetranectin; nucleotide sequence f rom ' S0ren- 
sen et al . (1995) . 

Secondary structure elements from Nielsen et al. (1997) . 
"a" denotes an a-helix; "0" denotes a p-strand; and "L fl 
5 denotes a loop. 

Fig. 3 shows an alignment of the nucleotide and amino 
acid sequences of human and murine tlec coding regions, 
htlec: the sequence derived from hTN; mtlec: the sequence 
derived from mTN. The position of the restriction endonu- 
10 clease sites for Bgl II, Kpn I, and Mun I are indicated. 

Fig. 4 shows an alignment of the nucleotide and amino 
acid sequences of human and murine tCTLD coding regions. 
htCTLD: the sequence derived from hTN; mtCTLD: the se- 
quence derived from mTN. The position of the restriction 
15 endonuclease sites for Bgl II, Kpn I, and Mun I are indi- 
cated. 

Fig. 5 shows an outline of the pT7H6FX-htlec expression 
plasmid. The FX-htlec fragment was inserted into pT7H6 
[Christensen et al . (1991)] between the Bam HI and Hind 
20 III cloning sites. 

Fig. 6 shows the amino acid sequence (one letter code) of 
the FX-htlec part of the H6FX-htlec fusion protein pro- 
duced by pT7H6FX-htlec. 

Fig. 7 shows an outline of the pT7H6FX-htCTLD expression 
25 plasmid. The FX-htCTLD fragment was inserted into pT7H6 
[Christensen et al. (1991)] between the Bam HI and- Hind 
III cloning sites. 

Fig. 8 shows the amino acid sequence (one letter code) of 
the FX-htCTLD part of the H6FX-htCTLD fusion protein pro- 
30 duced by pT7H 6 FX-htCTLD . 
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Fig.. 9 shows an outline of the pPhTN phagemid. The PhTN 
fragment was inserted into the phagemid pCANTAB 5E (Amer- 
sham Pharmacia Biotech, code no. 27-9401-01) between the 
Sfi I and Not I restriction sites. 

Fig. 10 shows the amino acid sequence (one letter code) 
of the PhTN part of the PhTN-gene III fusion protein pro- 
duced by pPhTN. 

Fig. 11 shows an outline of the pPhTN3 phagemid. The 
PhTN3 fragment was inserted into the phagemid pCANTAB 5E 
(Amersham Pharmacia Biotech, code no. 27-9401-01) between 
the Sfi I and Not I restriction sites. 

Fig. 12 shows the amino acid sequence (one letter code) 
of the PhTN3 part of the PhTN3-gene III fusion protein 
produced by pPhTN3. 

15 Fig. 13 shows an outline of the pPhtlec phagemid. The 

Phtlec fragment was inserted into the phagemid pCANTAB 5E 
(Amersham Pharmacia Biotech, code no. 27-9401-01) between 
the Sfi I and Not I restriction sites. 

Fig. 14 shows the amino acid sequence (one letter code) 
20 of the Phtlec part of the Phtlec-gene III fusion protein 
produced by pPhtlec. 

Fig. 15 shows an outline of the pPhtCTLD phagemid. The 
PhtCTLD fragment was inserted into the phagemid pCANTAB 
5E (Amersham Pharmacia Biotech, code no. 27-9401-01) be- 
25 tween the Sfi I and Not I restriction sites. 

Fig. 16 shows the amino acid sequence (one letter code) 
of the PhtCTLD part of the PhtCTLD-gene III fusion pro- 
tein produced by pPhtCTLD. 

Fig. 17 shows an outline of the pUC~mtlec. 
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Fig. 18 shows an outline of the pT7H6FX-mtlec expression 
plasmid. The FX-mtlec fragment was inserted into pT7H6 
[Christensen et al . (1991)] between the Bam HI and Hind 
III cloning sites. 

5 Fig. 19 shows the amino acid sequence (one letter code) 
of the FX-mtlec part of the H6FX-mtlec fusion protein 
produced by pT7H6FX-mtlec. 

Fig. 20 shows an outline of the pT7H6FX-mtCTLD expression 
plasmid. The FX-mtCTLD fragment was inserted into pT7H6 
10 [Christensen et al. (1991)] between the Bam HI and Hind 
III cloning sites. 

Fig. 21 shows the amino acid sequence (one letter code) 
of the FX-mtCTLD part of the H6FX-mtCTLD fusion protein 
produced by pT7H6FX-mtCTLD. 

15. Fig. 22 shows an outline of the pPmtlec phagemid. The 

Pmtlec fragment was inserted into the phagemid pCANTAB 5E 
(Amersham Pharmacia Biotech, code no. 27-9401-01) between 
the Sfi I and Not I restriction sites. 

Fig. 23 shows the amino acid sequence (one letter code) 
20 of the Pmtlec part of the Pmtlec-gene III fusion protein 
produced by pPmtlec. 

Fig. 24 shows an outline of the pPmtCTLD phagemid. The 
PmtCTLD fragment was inserted into the phagemid pCANTAB 
5E (Amersham Pharmacia Biotech, code no. 27-9401-01) be- 
25 tween the Sfi I and Not I restriction sites. 

Fig. 25 shows the amino acid sequence (one letter code) 
of the PmtCTLD part of the PmtCTLD-gene III fusion pro- 
tein produced by p PmtCTLD . 
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Fig, 26 shows an ELISA-type analysis of Phtlec-, PhTN3-, 
and M13K07 helper phage binding to anti-tetranectin or 
BSA. Panel A: Analysis with 3% skimmed milk/5 mM EDTA as 
blocking reagent. Panel B: Analysis with 3% skimmed milk 
5 as blocking reagent. 

Fig. 27 shows an ELISA-type analysis of Phtlec-, PhTN3-, 
and M13K07 helper phage binding to plasminogen (Pig) and 
BSA. Panel A: Analysis with 3% skimmed milk/5 mM EDTA as 
blocking reagent. Panel B: Analysis with 3% skimmed milk 
10 as blocking reagent. 

Fig. 28 shows an ELISA-type analysis of the B series and 
C series polyclonal populations, from selection round 2, 
binding to plasminogen (Pig) compared to background. 

Fig. 29 Phages from twelve clones isolated from the third 
15 round of selection analysed for binding to hen egg white 
lysozyme, human p : -microglobulin and background in an 
ELISA-type assay. 

Fig. 30 shows the amino acid sequence (one letter code) 
of the PrMBP part of the PrMBP-gene III fusion protein 
20 produced by pPrMBP. 

Fig. 31 shows an outline of the pPrMBP phagemid. The 
PrMBP fragment was inserted into the phagemid pCANTAB 5E 
(Amersham Pharmacia Biotech, code no. 27-9401-01) between 
the Sfi I and Not I restriction sites. 

25 Fig. 32 shows the amino acid sequence (one letter code) 
of the PhSP-D part of the PhSP-D-gene III fusion protein 
produced by pPhSP-D. 



Tig. 33 shows an outline of the pPhSP-D phagemid. The 
PhSP-D fragment was inserted into the phagemid p CAN TAB 5E 
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(Amersham Pharmacia Biotech, code no. 27-9401-01) between 
the Sfi I and Not I restriction sites. 

Fig. 34. Phages from 4 8 clones isolated from the third 
round of selection in the #1 series analysed for binding 
5 to hen egg white lysozyme and to A- HA in an ELISA-type 
assay. 

Fig. 35. Phages from 48 clones isolated from the third 
round of selection in the #4 series analysed for binding 
to hen egg white lysozyme and to A- HA in an ELISA-type 
10 assay. 

DETAILED DESCRIPTION OF THE INVENTION 
I. Definitions 

The terms "C-type lectin-like protein" and "C-type 
lectin" are used to refer to any protein present in, or 
15 encoded in the genomes of, any eukaryotic species, which 
protein contains one or more CTLDs or one or more domains 
belonging to a subgroup of CTLDs, the CRDs, which bind 
carbohydrate ligands. The definition specifically in- 
cludes membrane attached C-type lectin-like proteins and 
20 C-type lectins, "soluble" C-type lectin-like proteins and 
C-type lectins lacking a functional transmembrane domain 
and variant C-type lectin-like proteins and C-type 
lectins in which one or more amino acid residues have 
been altered in vivo by glycosylation or any other post- 
25 synthetic modification, as well as any product that is 
obtained by chemical modification of C-type lectin-like 
proteins and C-type lectins. 

In the claims and throughout the specification certain 
alterations may be defined with reference to amino acid 
30 residue numbers of a CTLD domain or a CTLD-containing 
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protein. The amino acid numbering starts at the first N- 
terminal amino acid of the CTLD or the native or artifi- 
cial CTLD-containing protein product, as the case may be, 
which shall in each case be indicated by unambiguous ex- 
ternal literature reference or internal reference to a 
figure contained herein within the textual context. 

The terms "amino acid", "amino acids" and "amino acid 
residues" refer to all naturally occurring L-a- amino ac- 
ids. This definition is meant to include norleucine, or- 
nithine, and homocysteine. The amino acids are identified 
by either the single-letter or three-letter designations: 



Asp D aspartic acid 
Thr T threonine 
Ser S serine 
Glu E glutamic acid 
Pro P proline 
Gly G glycine 
Ala A alanine 
Cys C cysteine 
Val V valine 
Met M methionine 
Nle J norleucine 
Hey U homocysteine 



lie I isoleucine 

Leu L leucine 

Tyr Y tyrosine 

Phe F phenylalanine 

His H histidine 

Lys K lysine 

Arg R arginine 

Trp W tryptophan 

Gin Q glutamine 

Asn N asparagine 

Orn O ornithine 

Xxx X any L-a- amino acid, 



The naturally occurring L-ot-amino acids may be classified 
according to the chemical composition and properties of 
their side chains. They are broadly classified into two 
groups, charged and uncharged. Each of these groups is 
divided into subgroups to classify the amino acids more 
accurately: 



30 



A. Charged Amino Acids 
Acidic Residues: 
Basic Residues: 



Asp, Glu 

Lys, Arg, His, Orn 
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B. Uncharged Amino Acids 

Hydrophilic Residues: Ser, Thr, Asn, Gin 

Aliphatic Residues: Gly, Ala, Val, Leu, He, 

Nle 

5 Non-polar Residues: Cys, Met, Pro, Hey 

Aromatic Residues: Phe, Tyr, Trp 

The terms "amino acid alteration" and "alteration" refer 
to amino acid substitutions, deletions or insertions or 
any combinations thereof in a CTLD amino acid sequence. 

10 In the CTLD variants of the present invention such al- 
teration is at a site or sites of a CTLD amino acid se- 
quence. Substitutional variants herein are those that 
have at least one amino acid residue in a native CTLD se- 
quence removed and a different amino acid inserted in its 

15 place at the same position. The substitutions may be sin- 
gle, where only one amino acid in the molecule has been 
substituted, or they may be multiple, where two or more 
amino acids have been substituted in the same molecule. 

The designation of the substitution variants herein con- 
20 sists of a letter followed by a number followed by a let- 
ter. The first (leftmost) letter designates the amino 
acid in the native (unaltered) CTLD or CTLD-containing 
protein. The number refers to the amino acid position 
where the amino acid substitution is being made, and the 
25 second (righthand) letter designates the amino acid that 
is used to replace the native amino acid. As mentioned 
above, the • numbering starts with "1" designating the N- 
terminal amino acid sequence of the CTLD or the CTLD- 
containing protein, as the case may be. Multiple altera- 
30 tions are separated by a comma (,) in the notation for 
ease of reading them. 

The terms "nucleic acid molecule encoding", "DNA sequence 
encoding", and "DNA encoding" refer to the order or se- 
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quence of deoxyribonucleotides along a strand of deoxy- 
ribonucleic acid. The order of these deoxyribonucleotides 
determines the order of amino acids along the polypeptide 
chain. The DNA sequence thus encodes the amino acid se- 
5 quence. 

The terms "mutationally randomised sequence", "randomised 
polypeptide segment", "randomised amino acid sequence", 
"randomised oligonucleotide" and "mutationally randomised 
sequence", as well as any similar terms used in any con- 

10 text to refer to randomised sequences, polypeptides or 
nucleic acids, refer to ensembles of polypeptide or nu- 
cleic acid sequences or segments, in which the amino acid 
residue or nucleotide at one or more sequence positions 
may differ between different members of the ■ ensemble of 

15 polypeptides or nucleic acids, such that the amino acid 

residue or nucleotide occurring at each such sequence po- 
sition may belong to a set of amino acid residues or nu- 
cleotides that may include all possible amino acid resi- 
dues or nucleotides or any restricted subset thereof. 

20 Said terms are often used to refer to ensembles in which 
the number of amino acid residues or nucleotides is the 
same for each member of the ensemble, but may also be 
used to refer to such ensembles in which the number of 
amino acid residues or nucleotides in each member of the 

25 ensemble may be any integer number within an appropriate 
range of integer numbers. 

II. Construction and utility of combinatorial CTLD li- 
braries 

Several systems displaying phenotype, in terms of puta- 
30 tive ligand binding modules or modules with putative en- 
zymatic activity, have been described- These include: 
phage display (e.g. the filamentous phage fd [Dunn 
(1996), Griffiths amd Duncan (1998), Marks et al. 
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(1992)], phage lambda [Mikawa et al. (1996)]), display on 
eukarotic virus (e.g. baculovirus [Ernst et al . (2000)]), 
cell display (e.g. display on bacterial cells [Benhar et 
al. (2000)], yeast cells [Boder and Wittrup (1997)], and 
5 mammalian cells [Whitehorn et al. (1995)], ribosome 

linked display [Schaffitzel et al . (1999)], and plasmid 
linked display [Gates et al . (1996)]. 

The most commonly used method for phenotype display and 
linking this to genotype is by phage display. This is ac- 

10 complished by insertion of the reading frame encoding the 
scaffold protein or protein of interest into an intra- 
domain segment of a surface exposed phage protein. The 
filamentous phage fd (e.g. M13) has proven most useful 
for this purpose. Polypeptides, protein domains, or pro- 

15 teins are the most frequently inserted either between the 
"export" signal and domain 1 of the fd gene III protein 
or into a so-called hinge region between domain 2 and do- 
main 3 of the fd-phage gene III protein. Human antibodies 
are the most frequently used proteins for the isolation 

20 of new binding units, but other proteins and domains have 
also been used (e.g. human growth hormone [Bass et al . 
(1990)], alkaline phosphatase [McCafferty et al . (1991)], 
p-lactamase inhibitory protein [Huang et al. (2000)], and 
cytotoxic T lymphocyte-associated antigen 4 [Hufton et 

25 al. (2000)]. The antibodies are often expressed and pre- 
sented as scFv or Fab fusion proteins. Three strategies 
have been employed. Either a specific antibody is used as 
a scaffold for generating a library of mutationally ran- 
domised sequences within the antigen binding clefts [e.g. 

30 Fuji et al. (1998)] or libraries representing large en- ■ 
sembles of human antibody encoding genes from non- 
immunised hosts [e.g. Nissim et al. (1994)] or from immu- 
nised hosts [e.g. Cyr and Hudspeth (2000)] are cloned 
into the fd phage vector. 
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The general procedure for accomplishing the generation of 
a display system for the generation of CTLD libraries 
comprise essentially 

(1) identification of the location of the loop- 

5 region, by referring to the 3D structure of the 

CTLD of choice, if such information is available, 
or, if not, identification of the sequence loca- 
tions of the [32-, (33- and (34 strands by sequence 
alignment with the sequences shown in Fig, 1, as 
10 aided by the further corroboration by identifica- 

tion of sequence elements corresponding to the £2 
and (33 consensus sequence elements and p4-strand 
characteristics, also disclosed above; 

(2) subcloning of a nucleic acid fragment encoding 
15 the CTLD of choice in a protein display vector 

system with or without prior insertion of endonu- 
clease restriction sites close to the sequences 
encoding 02, p3 and p4; and 

(3) substituting the nucleic acid fragment encoding 
some or all of the loop-region of the CTLD of 
choice with randomly selected members of an en- 
semble consisting of a multitude of nucleic acid 
fragments which after insertion into the nucleic 
acid context encoding the receiving framework 
will substitute the nucleic acid fragment encod- 
ing the original loop-region polypeptide frag- 
ments with randomly selected nucleic acid frag- 
ments. Each of the cloned nucleic acid fragments, 
encoding a new polypeptide replacing an original 
loop-segment or the entire loop-region, will be 
decoded in the reading frame determined within 
its new sequence context. 
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Nucleic acid fragments may be inserted in specific loca- 
tions into receiving nucleic acids by any common method 
of molecular cloning of nucleic acids, such as by appro- 
priately designed PCR manipulations in which chemically 
5 synthesized nucleic acids are copy-edited into the re- 
ceiving nucleic acid, in which case no endonuclease re- 
striction sites are required for insertion. Alterna- 
tively, the insertion/excision of nucleic acid fragments 
may be facilitated by engineering appropriate combina- 
LO tions of endonuclease restriction sites into the target 
nucleic acid into which suitably designed oligonucleotide 
fragments may be inserted using standard methods of mo- 
lecular cloning of nucleic acids. 

It will be apparent that interesting CTLD variants iso- 
.5 lated from CTLD libraries in which restriction endonucle- 
ase sites have been inserted for convenience may contain 
mutated or additional amino acid residues that neither 
correspond to residues present in the original CTLD nor 
are important for maintaining the interesting new affin- 
0 ity of the CTLD variant. If desirable, e.g. in case the 
product needs to be rendered as non-immunogenic as possi- 
ble, such residues may be altered or removed by back- 
mutation or deletion in the specific clone, as appropri- 
ate . 

5 The ensemble consisting of a multitude of nucleic acid 
fragments may be obtained by ordinary methods for chemi- 
cal synthesis of nucleic acids by directing the step-wise 
synthesis to add pre-defined combinations of pure nucleo- 
tide monomers or a mixture of any combination of nucleo- 

0 tide monomers at each step in the chemical synthesis of 
the nucleic acid fragment. In this way it is possible to 
generate any level of sequence degeneracy, from one 
unique nucleic acid sequence to the most complex mixture, 
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which will represent a complete or incomplete representa- 
tion of maximum number unique sequences of 4 N , where N is 
the number of nucleotides in the sequence. 

Complex ensembles consisting of multitudes of nucleic 
5 acid fragments may, alternatively, be prepared by gener- 
ating mixtures of nucleic acid fragments by chemical, 
physical or enzymatic fragmentation of high-molecular 
mass nucleic acid compositions like, e.g., genomic nu- 
cleic acids extracted from any organism. To render such 

10 mixtures of nucleic acid fragments useful in the genera- 
tion of molecular ensembles, as described here, the crude 
mixtures of fragments, obtained in the initial cleavage 
step, would typically be size-fractionated to obtain 
fragments of an approximate molecular mass range which 

15 would then typically be adjoined to a suitable pair of 

linker nucleic acids, ' designed to facilitate insertion of 
the linker-embedded mixtures of size-restricted oligonu- 
cleotide fragments into the receiving nucleic acid vec- 
tor . 

20 To facilitate the construction of combinatorial CTLD li- 
braries in tetranectin, the model CTLD of the preferred 
embodiment of the invention, suitable restriction sites 
located in the vicinity of the nucleic acid sequences en- 
coding (32, J33 and 04 in both human and murine tetranectin 

2 5 were designed with minimal perturbation of the polypep- 
tide sequence encoded by the altered sequences. It was 
found possible to establish a design strategy, as de- 
tailed below, by which identical endonuclease restriction 
sites could be introduced at corresponding locations in . 

30 the two sequences, allowing interesting loop-region vari- 
ants to be readily excised from a recombinant murine CTLD 
and inserted correctly into the CTLD framework of human 
tetranectin or vice versa. 
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Analysis of the nucleotide sequence encoding the mature 
form of human tetranectin reveals (Fig. 2) that a recog- 
nition site for the restriction endonuclease Bgl II is 
found at position 326 to 331 (AGATCT) , involving the en- 
5 coded residues Glul09, IlellO, and Trplll of 02, and that 
a recognition site for the restriction endonuclease Kas I 
is found at position 382 to 387 (GGCGCC), involving the 
encoded amino acid residues Glyl28 and Alal29 (located C- 
terminally in loop 2) . 

10 Mutation, by site directed mutagenesis, of G513 to A and 
of C514 to T in the nucleotide sequence encoding human 
tetranectin would introduce a Mun I restriction endonu- 
clease recognition site therein, located at position -511 
to 516, and mutation of G513 to A in the nucleotide se- 
15 quence encoding murine tetranectin would introduce a Mun 
I restriction endonuclease site therein at a position 
corresponding to the Mun I site in human tetranectin, 
without affecting the amino acid sequence of either of 
the encoded protomers . Mutation, by site directed 
20 mutagenesis, of C327 to G and of G386 to C in the nucleo- 
tide sequence encoding murine tetranectin would introduce 
a Bgl II and a Kas I restriction endonuclease recognition 
site, respectively, therein. Additionally, A325 in the 
nucleotide sequence encoding murine tetranectin is 
25 mutagenized to a G. These three mutations would affect 

the encoded amino acid sequence by substitution of Asnl09 
to Glu and Glyl29 to Ala, respectively. Now, the restric- 
tion endonuclease Kas I is known to exhibit marked site 
preference and cleaves only slowly the tetranectin coding 
30 region. Therefore, a recognition site for another re- 
striction endonuclease substituting the Kas I site is 
.preferred (e.g. the recognition site for the restriction 
endonuclease Kpn I, recognition sequence GGTACC) . The nu- 
cleotide and amino acid sequences of the resulting 
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tetranectin derivatives, human tetranectin lectin (htlec) 
and murine tetranectin lectin (mtlec) are shown in Fig. 
3. The nucleotide sequences encoding the htlec and mtlec 
protomers may readily be subcloned into devices enabling 
5 protein display of the linked nucleotide sequence (e.g. 
phagemid vectors) and into plasmids designed for het- 
erologous expression of protein [e.g. pT7H6, Christensen 
et al. (1991)]. Other derivatives encoding only the mu- 
tated CTLDs of either htlec or mtlec (htCTLD and mtCTLD, 
10 respectively) have also been constructed and subcloned 
into phagemid vectors and expression plasmids, and the 
nucleotide and amino acid sequences of these CTLD deriva- 
tives are shown in Fig. 4. 

The presence of a common set of recognition sites for the 
restriction endonucleases Bgl II, Kas I or Kpn I, and Mun 
I in the ensemble of tetranectin and CTLD derivatives al- 
lows for the generation of protein libraries with random- 
ised amino acid sequence in one or more of the loops and 
at single residue positions in P4 comprising the lectin 
ligand binding region by ligation of randomised oligonu- 
cleotides into properly restricted phagemid vectors en- 
coding htlec, mtlec, htCTLD, or mtCTLD derivatives. 

After rounds of selection on specific targets (e.g. eu- 
karyotic cells, virus, bacteria, specific proteins, poly- 
2 5 saccharides, other polymers, organic compounds etc.) DNA 
may be isolated from the specific phages, and the nucleo- 
tide sequence of the segments encoding the ligand-binding 
region determined, excised from the phagemid DNA and 
transferred to the appropriate derivative expression vec- 
30 tor for heterologous production of the desired product. 
Heterologous production in a prokaryote may be preferred 
because -an efficient protocol for the isolation and re- 
folding of tetranectin and derivatives has been reported 
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(International Patent Application Publication WO 94/18227 
A2). 

A particular advantage gained by implementing the tech- 
nology of the invention, using tetranectin as the scaf- 
5 fold structure, is that the structures of the murine and 
human tetranectin scaffolds are almost identical, allow- 
ing loop regions to be swapped freely between murine and 
human tetranectin derivatives with retention of function- 
ality. Swapping of loop regions between the murine and 

10 the human framework is readily accomplished within the 
described system of tetranectin derivative vectors, and 
it is anticipated, that the system can be extended to in- 
clude other species (e.g. rat, old and new world monkeys, 
dog, cattle, sheep, goat etc.) of relevance in medicine 

15 or veterinary medicine in view of the high level of ho- 
mology between man and mouse sequences, even at the ge- 
netic level. Extension of this strategy to include more 
species may be rendered possible as and when tetranectin 
is eventually cloned and/or sequenced from such species. 

20 Because the C-type lectin ligand-binding region repre- 
sents a different topological unit compared to the anti- 
gen binding clefts of the antibodies, we envisage that 
the selected binding specificities will be of a different 
nature compared to the antibodies. Further, we envisage 

25 that the tetranectin derivatives may have advantages com- 
pared to antibodies with respect to specificity in bind- 
ing sugar moieties or polysaccharides- The tetranectin 
derivatives may also be advantageous in selecting binding 
specificities against certain natural or synthetic or- 

30 ganic compounds. 

Several CTLDs are known to bind calcium ions, and binding 
of other ligands is often either dependent on calcium 
(e.g. the collectin family of C-type lectins, where the 
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calcium ion bound in site 2 is directly involved in bind- 
ing the sugar ligand [Weis and Drickamer (1996)]) or sen- 
sitive to calcium (e.g. tetranectin, where binding of 
calcium involves more of the side chains known otherwise 
5 to be involved in plasminogen kringle 4 binding [Grav- 
ersen et al . (1998)]). The calcium binding sites charac- 
teristic of the C-type lectin-like protein family are 
comprised by residues located in loop 1, loop 4 and 0- 
strand 4 and are dependent on the presence of a proline 

10 residue (often interspacing loop 3 and loop 4 in the 

structure), which upon binding is found invariantly in 
the cis conformation. Moreover, binding of calcium is 
known to enforce structural changes in the CTLD loop- 
region [Ng et al. (1998a, b) ] . We therefore envisage, that 

15 binding to a specific target ligand by members of combi- 
national libraries with preserved CTLD metal binding 
sites may be modulated by addition or removal of divalent 
metal ions (e.g. calcium ions) either because the metal 
ion may be directly involved in binding, because it is a 

20 competitive ligand, or because binding of the metal ion 
enforces structural rearrangements within the putative 
binding site. 

The trimeric nature of several members of the C-type 
lectin and C-type lectin-like protein family, including 
25 tetranectin, and the accompanying avidity in binding may 
also be exploited in the creation of binding units with 
very high binding affinity. 

As can be appreciated from the disclosure above, the pre- 
sent invention has a broad general scope and a wide area 
30 of application. Accordingly, the following examples, de- 
scribing various embodiments thereof, are offered by way 
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of illustration only, not by way of limitation. 
Example 1 

Construction of tetranectin derived E.coli expression 
5 plasmids and phagemids 

The expression plasmid pT7H6FX-htlec, encoding the FX- 
htlec (SEQ ID NO: 01) part of full length H6FX-htlec fu- 
sion protein, was constructed by a series of four con- 
secutive site-directed mutagenesis experiments starting 

10 from the expression plasmid pT7H6-rTN 123 [Holtet et al. 
(1997)] using the QuickChange™ Site-Directed Mutagenesis 
Kit (Stratagene, La Jolla, CA) and performed as described 
by the manufacturer. Mismatching primer pairs introducing 
the desired mutations were supplied by DNA Technology 

15 (Aarhus, Denmark) , An outline of the resulting pT7H6FX- 
htlec expression plasmid is shown in Fig. 5, and the nu- 
cleotide sequence of the FX-htlec encoding insert is 
given as SEQ ID NO: 01. The amino acid sequence of the FX- 
htlec part of the H6FX-htlec fusion protein is shown in 

20 Fig. 6 and given as SEQ ID NO: 02. 

The expression plasmid pT7H6FX-htCTLD, encoding the FX- 
htCTLD (SEQ ID NO: 03) part of the H6FX-htCTLD fusion 
protein, was constructed by amplification and subcloning 
into the plasmid pT7H6 (i.e. amplification in a poly- 

25 merase chain reaction using the expression plasmid pT7H6^ 
htlec as template, and otherwise the primers, conditions, 
and subcloning procedure described for the construction 
of the expression plasmid pT7H6TN3 [Holtet et al. 
(1997)]. An outline of the resulting pT7H6FX-htCTLD 

30 expression plasmid is shown in Fig. 7, and the nucleotide 
sequence of the FX-htCTLD encoding insert is given as SEQ 
ID NO: 03. The amino acid sequence of the FX-htCTLD part 
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of the H6FX-htCTLD fusion protein is shown in Fig. 8 and 
given as SEQ ID NO: 04. 

The phagemids, pPhTN and pPhTN3, were constructed by 
ligation of the Sfi I and Not I restricted DNA fragments 
5 amplified from the expression plasmids pT7H6-rTN 123 
(with the oligonucleotide primers 5-CGGCTGAGCGGCCCA- 
GCCGGCCATGGCCGAGCCACCAACCCAGAAGC-3' [SEQ ID NO: 05] and 
5'-CCTGCGGCCGCCACGATCCCGAACTGG-3' [SEQ ID NO: 06]) and 
pT7H6FX-htCTLD (with the oligonucleotide primers 5'- 
10 CGGCTGAGCGGCCCAGCCGGCCATGGCCGCCCTGCAGACGGTC- 3 ' [SEQ ID 
NO: 07] and 5' -CCTGCGGCCGCCACGATCCCGAACTGG- 3 ' [SEQ ID 
NO: 06]), respectively, into a Sfi I and Not I precut vec- 
tor, pCANTAB 5E supplied by Amersham Pharmacia Biotech 
(code no. 27-9401-01) using standard procedures. Outlines 
15 of the resulting pPhTN and pPhTN3 phagemids are shown in 
Fig. 9 and Fig. 11, respectively, and the nucleotide se- 
quences of the PhTN and PhTN3 inserts are given as SEQ ID 
NO: 08 and SEQ ID NO: 10, respectively. The amino acid se- 
quences encoded by the PhTN and PhTN3 inserts are shown 
20 in Fig 10 (SEQ ID NO:09) and Fig. 12 (SEQ ID NO:ll), re- 
spectively. 

The phagemids, pPhtlec and pPhtCTLD, were constructed by 
ligation of the Sfi I and Not I restricted DNA fragments 
amplified from the expression plasmids pT7H6FX-htlec 

25 (with the oligonucleotide primers 5-CGGCTGAGCGGCCCAGCC- 
GGCCATGGCCGAGCCACCAACCCAGAAGC- 3 ' [SEQ ID NO: 05] and 5' - 
CCTGCGGCCGCCACGATCCCGAACTGG- 3' [SEQ ID NO: 06]). and 
pT7H6FX-htCTLD (with the oligonucleotide primers 5'- 
CGGCTGAGCGGCCCAGCCGGCCATGGCCGCCCTGCAGACGGTC-3' [SEQ ID 

30 NO: 07] and 5' -CCTGCGGCCGCCACGATCCCGAACTGG- 3' [SEQ ID 

NO: 06]), respectively, into a Sfi I and Not I precut vec- 
tor, pCANTAB 5E supplied by Amersham Pharmacia Biotech 
(code no. 27-9401-01) using standard procedures. Outlines 
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of the resulting pPhtlec and pPhtCTLD phagemids are shown 
in Fig. 13 and Fig, 15, respectively, and the nucleotide 
sequences of the Phtlec and PhtCTLD inserts are given as 
SEQ ID NO: 12 and SEQ ID NO: 14 , repectively. The amino 
5 acid sequences encoded by the Phtlec and PhtCTLD inserts 
are shown in Fig. 14 (SEQ ID NO: 13) and Fig. 16 (SEQ ID 
NO: 15) , respectively. 

A plasmid clone, pUOmtlec, containing the nucleotide se- 
quence corresponding to the murine tetranectin derivative 

10 mtlec (Fig. 3 and SEQ ID NO: 16) was constructed by four 
succesive subclonings of DNA subfragments in the follow- 
ing way: First, two oligonucleotides 5' - 
CGGAATTCGAGTCACCCACTCCCAAGGCCAAGAAGGCTGCAAATGCCAAGAAA- 
GATTTGGTGAGCTCAAAGATGTTC-3' (SEQ ID NO: 17) and 5'-GCG- 

15 GATCCAGGCCTGCTTCTCCTTCAGCAGGGCCACCTCCTGGGCCAGGACATCCAT- 
CCTGTTCTTGAGCTCCTCGAACATCTTTGAGCTCACC-3 ' (SEQ ID NO: 18) 
were annealed and after a filling in reaction cut with 
the restriction endonucleases Eco RI (GAATTC) and Bam HI 
(GGATCC) . and ligated into Eco RI and Bam HI precut pUC18 

20 plasmid DNA. Second, a pair of oligonucleotides 5'-GCA- 
GGCCTTACAGACTGTGTGCCTGAAGGGCACCAAGGTGAACTTGAAGTGCCTCCT- 
GGCCTTCACCCAACCGAAGACCTTCCATGAGGCGAGCGAG- 3 ' (SEQ ID 
NO : 1 9 ) and 5 ' -CCGCATGCTTCGAACAGCGCCTCGTTCTCTAGCTCTGAC- 
TGCGGGGTGCCCAGCGTGCCCCCTTGCGAGATGCAGTCCTCGCTCGCCTCATGG-3' 

25 (SEQ ID NO: 20) was annealed and after a filling in reac- 
tion cut with the restriction endonucleases Stu I 
(AGGCCT) and Sph I (GCATGC) and ligated into the Stu I 
and Sph I precut plasmid resulting from the first liga- 
tion. Third, an oligonucleotide pair 5' -GGTTCGAATACGCGC- 

3 0 . GCCACAGCGTGGGCAACGATGCGGAGATCTAAATGCTCCCAATTGC- 3 ' ( SEQ I D 
NO : 2 1 ) and 5 ' -CCAAGCTTCACAATGGCAAACTGGCAGATGTAGGGCAATTGG- 
GAGC AT T TAG AT C - 3 ' (SEQ ID NO: 22) was annealed and after a 
filling in reaction cut with the restriction endonucle- 
ases BstB I (TTCGAA) and Hind III (AAGCTT) and ligated 
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into the BstB I and Hind III precut plasmid resulting 
from the second ligation. Fourth, an oligonucleotide pair 
5 ' -CGGAGATCTGGCTGGGCCTCAACGACATGGCCGCGGAAGGCGCCTGGGTGGA- 
CATGACCGGTACCCTCCTGGCCTACAAGAACTGG-3' (SEQ ID NO: 23) and 
5 5' -GGGCAATTGATCGCGGCATCGCTTGTCGAACCTCTTGCCGTTGGCTGCGCCAG- 
ACAGGGCGGCGCAGTTCTCGGCTTTGCCGCCGTCGGGTTGCGTCGTGATCTCCGTC- 
TCCCAGTTCTTGTAGGCCAGG-3 ' (SEQ ID NO: 24) was annealed and 
after a filling in reaction cut with the restriction en- 
donucleases Bgl II (AGATCT) and Mun I (CAATTG) and ligat- 
10 ed into the Bgl II and Mun I precut plasmid resulting 
from the third ligation. An outline of the pUC-mtlec 
plasmid is shown in Fig. 17, and the resulting nucleotide 
sequence of the Eco RI to Hind III insert is given as SEQ 
ID NO: 16. 

15 The expression plasmids pT7H6FX-mtlec and pT7H6FX-mtCTLD 
may be constructed by ligation of the Bam HI and Hind III 
restricted DNA fragments, amplified from the pUC-mtlec 
plasmid with the oligonucleotide primer pair 5-CTGGGATCC- 
ATCCAGGGTCGCGAGTCACCCACTCCCAAGG- 3 ' (SEQ ID NO: 25) and 5'- 
20 CCGAAGCTTACACAATGGCAAA.CTGGC- 3 ' (SEQ ID NO:26), and with 

the oligonucleotide primer pair 5'- CTGGGATCCATCCAGGGTCG- 
CGCCTTACAGACTGTGGTC-3' (SEQ ID NO: 27), and 5' -CCGAAGCTT- 
ACACAATGGCAAACTGGC- 3 ' (SEQ ID NO : 2 6) , respectively, into 
Bam HI and Hind III precut pT7H6 vector using standard 

5 procedures. An outline of the expression plasmids 

pT7H6FX-mtlec and pT7H6FX-mtCTLD is shown in Fig. 18 and 
Fig. 20, respectively, and the nucleotide sequences of 
the FX-mtlec and FX-mtCTLD inserts are given as SEQ ID 
NO: 28 and SEQ ID NO: 30, respectively. The amino acid se- 

0 quences of the FX-mtlec and FX-mtCTLD parts of the fusion 
proteins H6FX- mtlec and H6FX-mtCTLD fusion proteins are 
shown in Fig. 19 (SEQ ID NO:29) and Fig. 21 (SEQ ID 
NO:31), respectively. 
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The phagemids pPmtlec and pPmtCTLD may be constructed by 
ligation of the Sfi I and Not I restricted DNA fragments 
(amplified from the pUC-mtlec plasmid with the 
oligonucleotide primer pair 5- 
5 CGGCTGAGCGGCCCAGCCGGCCATGGCCGAGTCACCCACTCCCAAGG- 3 ' [SEQ 
ID NO: 32], and 5 ' -CCTGCGGCCGCCACGATCCCGAACTGG- 3 ' [SEQ ID 
NO: 33] and with the oligonucleotide primers 5'- 
CGGCTGAGCGGCCCAGCCGGCCATGGCCGCCTTACAGACTGTGGTC- 3 9 [SEQ ID 
NO: 34] and 5' -CCTGCGGCCGCCACGATCCCGAACTGG- 3' [SEQ ID 

10 NO: 33], respectively) into a Sfi I and Not I precut 

vector pCANTAB 5E supplied by Amersham Pharmacia Biotech 
(code no, 27-9401-01) using standard procedures. Outlines 
of the pPmtlec and pPmtCTLD plasmids are shown in Fig. 22 
and Fig. 24, respectively, and the resulting nucleotide 

15 sequences of the Pmtlec and PmtCTLD inserts are given as 
SEQ ID NO: 35 and SEQ ID NO: 37, repectively. The amino 
acid sequences encoded by the Pmtlec and PmtCTLD inserts 
are shown in Fig. 23 (SEQ ID NO: 36) and Fig. 25 (SEQ ID 
NO: 38), respectively. 

20 

20 Example 2 

Demonstration of successful display of Phtlec and PhTN3 
on phages. 

In order to verify that the Phtlec and PhTN3 Gene III fu- 
sion proteins can indeed be displayed by the recombinant 

25 phage particles, the phagemids pPhtlec and pPhTN3 (de- 
scribed in Example 1) were transformed into E. coli TGI 
cells and recombinant phages produced upon infection with 
the helper phage M13K07. Recombinant phages were isolated 
by precipitation with poly (ethylene glycol) (PEG 8000) 

30 and samples of Phtlec and PhTN3 phage preparations as 
well as a s.ample of helper phage were subjected to an 
ELISA-type sandwich assay, in which wells of a Maxisorb 
(Nunc) multiwell plate were first incubated with anti- 
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human tetranectin or bovine serum albumin (BSA) and 
blocked in skimmed milk or skimmed milk/EDTA. Briefly, 
cultures of pPhtlec and pPhTN3 phagemid transformed TGI 
cells were grown at 37 *c in 2xTY-medium supplemented 
5 with 2% glucose and 100 mg/L ampicillin until A 600 reached 
0.5. By then the helper phage, M13K07, was added to a 
concentration of 5x10* pfu/mL. The cultures were incu- 
bated at 37 °C for another 30 min before cells were har- 
vested by centrifugation and resuspended in the same cul- 
10 ture volume of 2xTY medium supplemented with 50 mg/L 
kanamycin and 100 mg/L ampicillin and transferred to a 
fresh set of flasks and grown for 16 hours at 25 °C. 
Cells were removed by centrifugation and the phages pre- 
cipitated from 20 mL culture supernatant by the addition 
15 of 6 mL of ice cold 20% PEG 8000, 2.5 M NaCl . After mix- 
ing the solution was left on ice for one hour and centri- 
fuged at 4 °C to isolate the precipitated phages. Each 
phage pellet was resuspended in 1 mL of 10 mM tris-HCl pH 
8, 1 mM EDTA (TE) and incubated for 30 min before cen- 
0 trifugation. The phage containing supernatant was trans- 
ferred to a fresh tube. Along with the preparation of 
phage samples, the wells of a Maxisorb plate was coated 
overnight with (70 uL) rabbit anti-human tetranectin (a 
polyclonal antibody from DAKO A/S, code no. A0371) in a 
5 1:2000 dilution or with (70 uL) BSA (10 mg/mL) . Upon 

coating, the wells were washed three times with PBS (2.68 
mM KC1, 1.47 mM KH,P0 4 , 137 mM NaCl, 8.10 mM Na r HPO„ pH 
7.4) and blocked for one hour at 37 °C with 280 uL of ei- 
ther 3% skimmed milk in PBS, or 3% skimmed milk, 5mM EDTA 
) in PBS. Anti-tetranectin coated and BSA coated wells were 
then incubated with human Phtlec-, PhTN3-, or helper 
phage samples for 1 hour and then washed 3 times in PBS 
buffer supplemented with the appropriate blocking agent. 
Phages in the wells were detected after incubation with 
HRP-conjugated anti-phage conjugate (Amersham Pharmacia, 
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code no. 27-9421-01) followed by further washing. HRP ac- 
tivities were then measured in a 96-well ELISA reader us- 
ing a standard HRP chromogenic substrate assay. 

Phtlec and PhTN3 phages produced strong responses (14 
times background) in the assay, irrespective of the pres- 
ence or absence of EDTA in the blocking agent, whereas 
helper phage produced no response above background read- 
ings in either blocking agent. Only low binding to BSA 
was observed (Fig. 26) . 

It can therefore be concluded that the human Phtlec and 
PhTN3 phages both display epitopes that are specifically 
recognized by the anti-human tetranectin antibody. 



Example 3 

15 Demonstration of authentic ligand binding properties of 
Phtlec and PhTN3 displayed on phage 

The apo-form of the CTLD domain of human tetranectin 
binds in a lysine-sensitive manner specifically to the 
kringle 4 domain of human plasminogen [Graversen et al. 

20 (1998)]. Binding of tetranectin to plasminogen can be in- 
hibited by calcium which binds to two sites in the 
ligand-binding site in the CTLD domain (Kd approx . 0.2 
millimolar) or by lysine-analogues like AMCHA (6-arnino- 
cyclohexanoic acid), which bind specifically in the two 

25 stronger lysine-binding sites in plasminogen of which one 
is located in kringle 1 and one is located in kringle 4 
(Kd approx. 15 micromolar) . 



30 



To demonstrate specific AMCHA-sensitive binding of human 
Phtlec and PhTN3 phages to human plasminogen, an ELISA 
assay, in outline similar to that employed to demonstrate 
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the presence of displayed Phlec and PhCTLD GUI fusion 
proteins on the phage particles (cf. Example 2), was de- 
vised. 

Wells were coated with solutions of human plasminogen (10 
ug/mL), with or without addition of 5mM AMCHA. Control 
wells were coated with BSA. Two identical arrays were es- 
tablished, one was subjected to blocking of excess bind- 
ing capacity with 3% skimmed milk, and one was blocked 
using 3% skimmed milk supplemented with 5mM EDTA. Where 
appropriate, blocking, washing and phage stock solutions 
were supplemented by 5mM AMCHA. The two arrays of wells 
were incubated with either Phtlec-, or PhTN3-, or helper 
phage samples, and after washing the amount of phage 
bound in each well was measured using the HRP-conjugated 
antiphage antibody as above. The results are shown in 
Fig. 27, panels A and B, and can be summarized as follows 

(a) in the absence of AMCHA, binding of human Phtlec 
phages to plasminogen-coated wells generated re- 
sponses at 8-10 times background levels using ei- 
ther formulation of blocking agent, whereas human 
PhTN3 phages generated responses at 4 (absence of 
EDTA) or 7 (presence of EDTA) times background 
response levels. 

(b) In the presence of 5mM AMCHA, binding of human 
Phtlec- and PhTN3 phages to plasminogen was found 
to be completely abolished. 

(c) Phtlec and PhTN3 phages showed no binding to BSA, 
and control helper phages showed no binding to 
any of the immobilized substances. 

(d) Specific binding of human Phtlec and PhTN3 phages 
to a specific, ligand at moderate binding strength 
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(about 20 micromolar level) can be detected with 
high efficiency at virtually no background using 
a skimmed-milk blocking agent, well-known in the 
art of combinatorial phage technology as a pre- 
5 ferred agent effecting the reduction of non- 

specific binding. 

In conclusion, the results show that the Phtlec and PhTN3 
Gene III fusion proteins displayed on the phage particles 
exhibit plasminogen-binding properties corresponding to 
10 those of authentic tetranectin, and that the physical and 
biochemical properties of Phtlec and PhTN3 phages are 
compatible with their proposed use as vehicles for the 
generation of combinatorial libraries from which CTLD de- 
rived units with new binding properties can be selected. 

15 

Example 4 

Construction of the phage libraries Phtlec-lbOOl and 
Phtlec~lb002. 

All oligonucleotides used in this example were supplied 
20 by DNA Technology (Aarhus, Denmark) . 

The phage library Phtlec-lbOOl, containing random amino 
acid residues corresponding to Phtlec (SEQ ID NO: 12) po- 
sitions 141-146 (loop 3), 150-153 (part of loop 4), and 
residue 168 (Phe in P4), was constructed by ligation of 

25 20 \xq Kpnl and Muni restricted pPhtlec phagemid DNA. (cf, 
Example 1) with 10 fig of Kpnl and Muni restricted DNA 
fragment amplified from the oligonucleotide htlec-libl-tp 
(SEQ ID NO: 39), where N denotes a mixture of 25% of each 
of the nucleotides T, C, G, and A, respectively and S de- 

30 notes a mixture of 50 % of C and G, encoding the appro- 
priately randomized nucleotide sequence and the oligonu- 
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cleotides htlec-libl-rev (SEQ ID NO: 40) and htlec- 
libl/2-fo (SEQ ID NO: 41) as primers using standard con- 
ditions. The ligation mixture was used to transform so- 
called electrocompetent E. coli TG-1 cells by electropo- 
5 ration using standard procedures. After transformation 
the E. coli TG-1 cells were plated on 2xTY-agar plates 
containing 0.2 mg ampicillin/mL and 2% glucose and incu- 
bated over night at 30°C. 

The phage library Phtlec-lb002, containing random amino 

10 acid residues corresponding to Phtlec (SEQ ID NO: 12) po- 
sitions 121-123, 125 and 126 (most of loop 1), and resi- 
dues 150-153 (part of loop 4) was constructed by ligation 
of 20 |ig Bglll and Muni restricted pPhtlec phagemid DNA 
(cf, EXAMPLE 1) with 15 jag of Bglll and Muni restricted 

15 DNA fragment amplified from the pair of oligonucleotides 
htlec-lib2-tprev (SEQ ID NO: 42) and htlec-lib2-tpf o (SEQ 
ID NO: 43), where N denotes a mixture of 25% of each of 
the nucleotides T, C, G, and A, respectively and S de- 
notes a mixture of 50 % of C and G, encoding the appro- 

20 priately randomized nucleotide sequence and the oligonu- 
cleotides htlec-lib2-rev (SEQ ID NO: 44) and htlec- 
libl/2-fo (SEQ ID NO: 41) as primers using standard con- 
ditions. The ligation mixture was used to transform so- 
called electrocompetent E. coli TG-1 cells by electropo- 

25 ration using standard procedures. After transformation 
the E. coli TG-1 cells were plated on 2xTY-agar plates 
containing 0.2 mg ampicillin/mL and 2% glucose and incu- 
bated overnight at 30 C, C. 

The titer of the libraries Phtlec-lbOOl and -lb002 was 
30 determined to 1.4*10* and 3.2*10 9 clones, respectively. 
Six clones from each library were grown and phagemid DNA 
isolated using a standard miniprep procedure, and the nu- 
cleotide sequence of the loop-region determined (DNA 
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Technology, Aarhus, Denmark). One clone from each library 
failed, for technical reasons, to give reliable nucleo- 
tide sequence, and one clone from Phtlec-libOOl appar- 
ently contained a major deletion. The variation of nu- 
5 cleotide sequences, compared to Phtlec (SEQ ID NO: 12), 
of the loop-regions of the other nine clones (lb001-l, 
lb001-2, lb001-3, lb001-4, lb002-l, lb002-2, lb002-3, 
lb002-4, and lb002-5) is shown in Table 3. 
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Example 5 

Construction of the phage library PhtCTLD-lb003 

All oligonucleotides used in this example were supplied by 
DNA Technology (Aarhus, Denmark) . 

5 The phage library PhtCTLD-lb003, containing random amino 
acid residues corresponding to PhtCTLD (SEQ ID NO: 15) po- 
sitions 77 to 79 and 81 to 82 (loop 1) and 108 to 109 (loop 
4) was constructed by ligation of 20 u.g Bglll and Muni re- 
stricted pPhtCTLD phagemid DNA (cf . Example 1) with 10 ug 

10 of a Bglll and Muni restricted DNA fragment population 

encoding the appropriately randomised loop 1 and 4 regions 
with or without two and three random residue insertions in 
loop 1 and with three and four random residue insertions in 
loop 4. The DNA fragment population was amplified, from six 

15 so-called assembly reactions combining each of the three 

loop 1 DNA fragments with each of the two loop 4 DNA frag- 
ments as templates and the oligonucleotides TN-lib3-rev 
(SEQ ID NO: 45) and loop 3-4-5 tagfo (SEQ ID NO: 46) as 
primers using standard procedures. Each of the three loop 1 

20 fragments was amplified in a reaction with either the oli- 
gonucleotides looplb (SEQ ID NO:" 47), looplc (SEQ ID NO: 
48), or loopld (SEQ ID NO: 49) as template and the oligonu- 
cleotides TN-lib3-rev (SEQ ID NO: 45) and TN-KpnI-fo (SEQ 
ID NO: 50) as primers, and each of the two DNA loop 4 frag- 

25 ments was amplified in a reaction with either the oligonu- 
cleotide loop4b (SEQ ID NO: 51) or loop4c (SEQ ID NO: 52) 
as template and the oligonucleotides loop3-4rev (SEQ ID NO: 
53) and loop3-4fo (SEQ ID NO: 54) as primers using standard 
procedures. In the oligonucleotide sequences N denotes a 

30 mixture of 25% of each of the nucleotides T, C, G, and A, 
respectively and S denotes a mixture of 50 % of C and G, 
encoding the appropriately randomized nucleotide sequence. 
The ligation mixture was used to transform so-called elec- 
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trocompetent E . coli TG-1 cells by electroporation using 
standard procedures. After transformation the E. coli TG-1 
cells were plated on 2xTY-agar plates containing 0.2 mg am- 
picillin/mL and 2% glucose and incubated over night at 30 

5 °c: 

The size of the resulting library, PhtCTLD-lb003, was de- 
termined to 1.4*10 10 clones. Twenty four clones from the 
library were grown and phages and phagemid DNA isolated. 
The nucleotide sequences of the loop-regions were deter- 

10 mined (DNA Technology, Aarhus, Denmark) and binding to a 

polyclonal antibody against tetranectin, anti-TN (DAKO A/S, 
Denmark), analysed in an ELISA-type assay using HRP conju- 
gated anti-gene VIII (Amersham Pharmacia Biotech) as secon- 
dary antibody using standard procedures. Eighteen clones 

15 were found to contain correct loop inserts, one clone con- 
tained the wild type loop region sequence, one a major de- 
letion, two contained two or more sequences, and two clones 
contained a frameshift mutation in the region. Thirteen of 
the 18 clones with correct loop inserts, the wild type 
20 clone, and one of the mixed isolates reacted strongly with 
the polyclonal anti-TN antibody. Three of the 18 correct 
clones reacted weakly with the antibody, whereas, two of 
the correct clones, the deletion mutant, one of the mixed, 
and the two frameshift mutants did not show a signal above 
25 background. 

Example 6 

Phage selection by biopanning on anti-TN antibody. 

Approximately 10 11 phages from the PhtCTLD-lb003 library 
30 was used for selection in two rounds on the polyclonal 

anti-TN antibody by panning in Maxisorb immunotubes (NUNC, 
Denmark) using standard procedures. Fifteen clones out of 
7*10 7 from the plating after the second selection round 
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were grown and phagemid DNA isolated and the nucleotide se- 
quence determined. All 15 clones were found to encode cor- 
rect and different loop sequences. 

5 Example 7 

Model selection of CTLD-phages on plasminogen. 
I: elution by trypsin digestion after panning. 

In order to demonstrate that tetranectin derived CTLD bear- 
ing phages can be selected from a population of phages, 

10 mixtures of PhtCTLD phages isolated from a E. coli TGI cul- 
ture transformed with the phagemid pPhtCTLD (cf, EXAMPLE 1) 
after infection with M13K07 helper phage and phages iso- 
lated from a culture transformed with the phagemid pPhtCPB 
after infection with M13K07 helper phage at ratios of 1:10 

15 and 1:10 s , respectively were used in a selection experiment 
using panning in 96-well Maxisorb micro-titerplates (NUNC, 
Denmark) and with human plasminogen as antigen. The pPhtCPB 
phagemid was constructed by ligation of the double stranded 
oligonucleotide (SEQ ID NO: 55) with the appropriate re- 

20 striction enzyme overhang seguences into Kpnl and Muni re- 
stricted pPhtCTLD phagemid DNA. The pPhtCBP phages derived 
upon infection with the helper phages displays only the 
wild type M13 gene III protein because of the translation 
termination codons introduced into the CTLD coding region 

25 of the resulting pPhtCPB phagemid (SEQ ID NO: 56) . 

The selection experiments were performed in 96 well micro 
titer plates using standard procedures. Briefly, in each 
well 3 ug of human plasminogen in 100 uL PBS (PBS, 0.2 g 
KC1, 0.2 g KH 2 P0 4 , 8 g NaCl, 1.44 g Na 2 HP0 4 , 2H 2 0, water to 
30 1 L, and adjusted to pH 7.4 with NaOH) or 100 uL PBS (for 
analysis of non specific binding) was used for- over night 
coating at 4 °c and at 37 °c for one hour. After washing 
once with PBS, wells were blocked with 400 uL PBS and 3% 
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non fat dried milk for one hour at 37°C. After blocking 
wells were washed once in PBS and 0.1% Tween 20 and three 
times with PBS before the addition of phages suspended in 
100 jiL PBS, 3% non fat dried milk. The phages were allowed 
5 to bind at 37 °C for one hour before washing three times 
with PBS, Tween 20 and three times with PBS. Bound phages 
were eluted from each well by trypsin digestion in 100 |iL 
(1 rng/mL trypsin in PBS) for 30 min. at room temperature, 
and used for infection of exponentially growing E. coli TGI 
10 cells before plating and titration on 2xTY agar plates con- 
taining 2% glucose and 0.1 mg/mL ampicillin. 

Initially (round 1), 10 12 PhtCTLD phages (A series), a mix- 
ture of 10 10 PhtCTLD phages and 10 11 PhtCPB phages (B se- 
ries), or a mixture of 10 6 PhtCTLD and 10 11 PhtCPB phages (C 
15 series) were used. In the following round (round 2) 10 11 
phages of the output from each series were used. Results 
from the two rounds of selection are summarised in Table 4. 

Table 4: Selection of mixtures of PhtCTLD and PhtCPB by 
panning and elution with trypsin. 





Plasminogen 
(*10 5 colonies) 


Blank 

(*10 5 colonies) 


Round 1 


A 


113.0 


19.50 




B 


1.8 


1.10 




C 


0.1 


0.30 


Round 2 


A 


49 


0.10 




B 


5.2 


0.20 




• C 


0.3 


0.04 



20 

Phagemid DNA from 12 colonies from the second round of 
plating together with 5 colonies from a plating of the ini- 
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tial phage mixtures was isolated and the nucleotide se- 
quence of the CTLD region determined. From the initial 1/10 
mixture (B series) of PhtCTLD/PhtCPB one out of five were 
identified as the CTLD sequence. From the initial 1/10 5 
5 mixture (C series) all five sequences were derived from the 
pPhtCPB phagemid. After round 2 nine of the twelve se- 
quences analysed from the B series and all twelve sequences 
from the C series were derived from the pPhtCTLD phagemid. 

10 Example 8 

Model selection of CTLD-phages on plasminogen. 
II: elution by 0.1 M triethylamine after panning. 

In order to demonstrate that tetranectin derived CTLD- 
bearing phages can be selected from a population of phages, 

15 mixtures of PhtCTLD phages isolated from a E. coli TGI cul- 
ture transformed with the phagemid pPhtCTLD (cf, EXAMPLE 1) 
after infection with M13K07 helper phage and phages iso- 
lated from a culture transformed with the phagemid pPhtCPB 
(cf, EXAMPLE 6) after infection with M13K07 helper phage at 

20 ratios of 1:10" and 1:10 6 , respectively were used in a se- 
lection experiment using panning in 96-well Maxisorb micro- 
titerplates (NUNC, Denmark) and with human plasminogen as 
antigen using standard procedures. 

Briefly, in each well 3 jig of human plasminogen in 100 uL 
25 PBS (PBS, 0.2 g KC1, 0.2 g KH : PO<, 8 g NaCl, 1.44 g Na : HPO„, 
2H : 0, water to 1 L, and adjusted to pH 7.4 with NaOH) or 
100 uL PBS (for analysis of non specific binding) was used 
for over night coating at 4 °C and at 37 °c for one hour. ' 
After washing once with PBS, wells were blocked with 400 uL 
30 PBS and 3% non fat dried milk for one hour at 37 °C. After 
blocking wells were washed once in PBS and 0.1% Tween 20 
and three times with PBS before the addition of phages sus- 
pended in 100 uL PBS. 3% non fat dried milk. Th P nhaaes 
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were allowed to bind at 37 °c for one hour before washing 
15 times with PBS, Tween 20, and 15 times with PBS. Bound 
phages were eluted from each well by 100 |iL 0.1 M triethyl- 
amine for 10 min at room temperature, and upon neutralisa- 
5 tion with 0.5 vol. 1 M Tris-HCl pH 7.4, used for infection 
of exponentially growing E. coli TGI cells before plating 
and titration on 2xTY agar plates containing 2% glucose and 
0.1 mg/mL ampicillin. 

Initially (round 1) 10 12 PhtCTLD phages (A series), a mix- 
10 ture of 10 9 PhtCTLD phages and 10 n PhtCPB phages (B se- 
ries) , or a mixture of 10 5 PhtCTLD and 10 11 PhtCPB phages (C 
series) were' used. In the following round (round 2) 10 11 
phages of the output from each series were used. Results 
from the two rounds of selection are summarised in Table 5. 

15 Table 5: Selection of mixtures of PhtCTLD and PhtCPB by 
panning elution with triethylamine . 





Plasminogen 
(*10 4 colonies) 


Blank 
(*10 4 


colonies) 


Round 1 


A 


18 




0.02 




B 


0.5 




0. 00 




C 


0.25 




0. 02 


Round 2 


A 


n.d. 




n.d. 




B 


5.0 




0.00 




C 


1.8 




0.02 


Round 3 


A 


n.d. 




n.d. 




B 


H 




0.00 




C 


6.5 




0.02 . . 



n.d. = not determined 
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Phage mixtures from the A and the B series from the second 
round of selection were grown using a standard procedure, 
and analysed for binding to plasminogen in an ELISA-type 
assay. Briefly, in each well 3 \xg of plasminogen in 100 juL 
5 PBS (PBS, 0.2 g KCl, 0.2 g KH2P04, 8 g NaCl, 1.44 g 

Na : HP0 4 , 2H 2 0, water to 1 L, and adjusted to pH 7.4 with 
NaOH) or 100 |iL PBS (for analysis of non specific binding) 
was used for over night coating at 4 °C and at 37 °C for 
one hour. After washing once with PBS, wells were blocked 

10 with 400 \xL PBS and 3% non fat dried milk for one hour at 

370C. After blocking wells were washed once in PBS and 0.1% 
Tween 20 and three times with PBS before the addition of 
phages suspended in 100 p,L PBS, 3% non fat dried milk. The 
phage mixtures were allowed to bind at 37 °C for one hour 

15 before washing three times with PBS, Tween 20, and three 
times with PBS. After washing, 50 \xh of a 1:5000 dilution 
of a HRP-conjugated anti-gene VIII antibody (Amersham Phar- 
macia Biotech) in PBS, 3% non fat dried milk was added to 
each well and incubated at 37 °C for one hour. After bind- 

20 ing of the "secondary" antibody wells were washed three 

times with PBS,. Tween 20, and three times with PBS before 
the addition of 50 |iL of TMB substrate (DAKO-TMB One-Step 
Substrate System, code: S1600, DAKO, Denmark) . Reaction was 
allowed to proceed for 20 min. before quenching with 0.5 

25 vol. 0.5 M H : S0 4 , and analysis. The result of the ELISA 
analysis confirmed specific binding to plasminogen of 
phages in both series (fig. 28) . 

Example 9 

30 Selection of phages from the library Phtlec-lb002 binding 
to hen egg white lysozyme . 

1.2*10 1C phages, approximately 250 times the size of the 
original library, derived from the Phtlec-lb002 library 
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(cf, EXAMPLE 4) were used in an experimental procedure for 
the selection of phages binding to hen egg white lysozyme 
involving sequential rounds of panning using standard pro- 
cedures . 

5 Briefly, 30 (ig of hen egg white lysozyme in 1 mL PBS (PBS, 
0,2 g KC1, 0.2 g KH c P0 4 , 8 g NaCl, 1.44 g Na 2 HP0 4 , 2H 2 0, wa- 
ter to 1 L, and adjusted to pH 7.4 with NaOH) or 1 mL PBS 
(for analysis of non specific binding) was used for over 
night coating of Maxisorb immunotubes (NUNC, Denmark) at 4 

0 °C and at 37 °C for one hour. After washing once with PBS, 
tubes were filled and blocked with PBS and 3% non fat dried 
milk for one hour at 37°C. After blocking tubes were washed 
once in PBS, 0,1% Tween 20 and three times with PBS before 
the addition of phages suspended in 1 mL PBS, 3% non fat 

5 dried milk. The phages were allowed to bind at 37 °C for 
one hour before washing six times with PBS, Tween 20 and 
six times with PBS. Bound phages were eluted from each well 
by 1 mL 0.1 M triethylamine for 10 min at room temperature, 
and upon neutralisation with 1 M Tris-HCl pH 7.4, used for 

0 infection of exponentially growing E. coli TGI cells before 
plating and titration on 2xTY agar plates containing 2% 
glucose and 0.1 mg/mL ampicillin. In the subsequent rounds 
of selection approximately 10 12 phages derived from a cul- 
ture grown from the colonies plated after infection with 

5 the phages eluted from the lysozyme coated tube were used 
in the panning procedure. However, the stringency in bind- 
ing was increased by increasing the number of washing step 
after phage panning from six to ten. 

The results from the selection procedure is shown in Table 
0 7. 
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Table 7 : Selection by panning of lysozyme binding phages 
from Phtlec-lb002 library. 





Lysozyme 


Blank 


Ratio 


Round 1 


2.4*10 4 


n. a. 


n.a. 


Round 2 


3.5*10 3 


4.ono 2 


9 


Round 3 


3.2*10 5 


2.5*10 2 


1.3*10 3 



n.a. = not applicable 



Phages were grown from twelve clones isolated from the 
5 third round of selection in order to analyse the specific- 
ity of binding using a standard procedure, and analysed for 
binding to hen egg white lysozyme and human (3 2 - 
microglobulin in an ELISA-type assay. Briefly, in each well 
3 \xq of hen egg white lysozyme in 100 jaL PBS (PBS, 0.2 g 

10 KC1, 0.2 g KH-PO,, 8 g NaCl, 1.44 g Na-HPO*, 2H 2 0, water to 
1 L, and adjusted to pH 7.4 with NaOH) , or 3 pig of human 
P^-microglobulin, or 100 \iL PBS (for analysis of non spe- 
cific binding) was used for over night coating at 4 °C and 
at 37 °C for one hour. After washing once with PBS, wells 

15 were blocked with 400 piL PBS and 3% non fat dried milk for 
one hour at 37°C. After blocking wells were washed once in 
PBS and 0,1% Tween 20 and three times with PBS before the 
addition of phages suspended in 100 |iL PBS, 3% non fat 
dried milk. The phages were allowed to bind at 37 °C for 

20 one hour before washing three times with PBS, Tween 20 and 
three times with PBS. After washing, 50 |iL of a 1 to 5000 
dilution of a HRP-conjugated anti-gene VIII antibody (Amer- 
sham Pharmacia Biotech) in PBS, 3% non fat dried milk was 
added to each well and incubated at 37 °C for one hour. Af- 

25 ter binding of the "secondary" antibody wells were washed 
three times with PBS, Tween 20 and three times with PBS be- 
fore the addition of 50 \xL of TMB substrate (DAKO-TMB One- 
Step Substrate System, code: S1600, DAKO, Denmark). Reac- 
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5 



10 



20 



tion was allowed to proceed for 20 min before quenching 
with 0.5 M H ; S0 4 . 

Results showing relatively weak but specific binding to ly- 
sozyme are summarised in Fig. 29. 



EXAMPLE 10 

Constru ction of the rat mannose-bindinq protein CTLD (r- 
MBP) der ived phaqemid (pPrMBP) and human lung surfactant 
protein D CTLD (h-SP-D) derived phaqemid (pPhSP-D) 



The phagemid, pPrMBP, is constructed by ligation of the Sfi 
I and Not I restricted DNA fragment amplified from cDNA, 
isolated from rat liver (Drickamer, K., et al., J. Biol. 
Chem. 1987, 262 ( 6) : 2582-2589) (with the oligonucleotide 
primers SfiMBP 5 1 -CGGCTGAGCGGCCCAGCCGGCCATGGC- 
15 CGAGCCAAACAAGTTGCATGCCTTCTCC-3' [SEQ ID NO: 62] andNotMBP 
5 ' -GCACTCCTGCGGCCGCGGCTGGGAACTCGCAGAC-3 ' [SEQ ID NO: 63]) 
into a Sfi I and Not I precut vector, pCANTAB 5E supplied 
by Amersham Pharmacia Biotech (code no. 27-9401-01) using 
standard procedures. Outlines of the resulting pPrMBP is 
shown in Fig. 31 and the nucleotide sequence of PrMBP is 
given as (SEQ ID NO: 58) . The amino acid sequence encoded by 
the PrMBP insert is shown in Fig. 30 (SEQ ID NO: 59). 

The phagemid, pPhSP-D, is constructed by ligation of the Sfi 
I and Not I restricted DNA fragment amplified from cDNA, 

25 isolated from human lung (Lu, J., et al., Biochem J. 1992 
jun 15; 284:795-802) (with the oligonucleotide primers 
SfiSP-D 5' -CGGCTGAGCGGCCCAGCCGGCCATGGCCGAGCCAAAGAAAGTTGA- 
GCTCTTCCC-3 1 (SEQ ID NO: 64] and NotSP-D 5 1 -GCACTCCTGCGGC- 
CGCGAACTCGCAGACCACAAGAC-3 1 [SEQ ID NO: 65]) into a Sfi I and 

30 Not I precut vector, pCANTAB 5E supplied by Amersham Phar- 
macia Biotech (code no.. 27-9401-01) using standard proce- 
dures. Outlines of the resulting 'pPhSP-D is shown in Fig. 
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33 and the nucleotide sequence of PhSP-D, is given as (SEQ 
ID NO: 60) . The amino acid sequences encoded by the PhSP-D 
insert is shown in Fig 32 (SEQ ID NO: 61) . 

5 Example 11 

Construction of the phage library PrMBP-lbOOl 

The phage library PrMBP-lbOOl, containing random amino acid 
residues corresponding to PrMBP CTLD (SEQ ID NO: 59) posi- 
■ tions 71 to 73 or 70 to 76 (loop 1) and 97 to 101 or 100 to 

10 101 (loop 4) is constructed by ligation of 20 ^g Sfil and 
Not I restricted pPrMBP phagemid DNA (cf . Example 10) with 
10 jig of a Sfil and NotI restricted DNA fragment population 
encoding the appropriately randomised loop 1 and 4 regions. 
The DNA fragment population is amplified, from nine assem- 

15 bly reactions combining each of the three loop 1 DNA frag- 
ments with each of the three loop 4 DNA fragments as tem- 
plates and the oligonucleotides Sfi-tag 5 * -CGGCTGAGCGGCCCA- 
GC-3 1 (SEQ ID NO: 74) and Not-tag 5 1 -GCACTCCTGCGGCCGCG- 

3 f (SEQ ID NO: 75) as primers using standard procedures, 

20 Each of the three loop 1 fragments is amplified in a pri- 
mary PCR reaction with pPrMBP phagmid DNA (cf . Example 10) 
as template and the oligonucleotides MBPloopla fo (SEQ ID 
NO: 66), MBPlooplb fo (SEQ ID NO: 67) or MBPlooplc fo (SEQ ID 
NO: 68) and SfiMBP (SEQ ID NO: 62) as primers, and further 

25 amplified in a secondary PCR reaction using Sfi-tag (SEQ ID 
NO: 74) and MBPloopl-tag fo (SEQ ID NO: 69). Each of the 
three DNA loop 4 fragments is amplified in a primary PCR 
reaction with pPrMBP phagemid DNA (cf . Example 10) as tem- 
plate and the oligonucleotides MBPloop4a rev (SEQ ID 

30 NO: 71), MBPloop4b rev (SEQ ID NO: 72) or MBPloop4c rev (SEQ 
" ID NO: 73) and NotMBP (SEQ ID NO: 63) as primers .using stan- 
dard procedures and further amplified in a secondary PCR 
reaction using MBPloop4-tag rev (SEQ ID NO: 70) and Not-tag 
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(SEQ ID NO: 63). In the oligonucleotide sequences N denotes 
a mixture of 25% of each of the nucleotides T, C, G, and A, 
respectively, and S denotes a mixture of 50 % of C and G, 
encoding the appropriately randomized nucleotide sequence. 
5 The ligation mixture is used to transform so-called elec- 
trocompetent E. coli TG-1 cells by electroporation using 
standard procedures. After transformation the E. coli TG-1 
cells are plated on 2xTY-agar plates containing 0.2 mg am- 
picillin/mL and 2% glucose and incubated over night at 30 
10 *c. 

Example 12 

Construction of the phage library PhSP-D-lbOOl 

The phage library PhSP-D-lbOOl, containing random amino 
15 acid residues corresponding to PhSP-D CTLD insert (SEQ ID 
NO: 61) positions 74 to 76 or 73 to 79 (loop 1) and 100 to 
104 or 103 to 104 (loop 4) is constructed by ligation of 20 
Hg Sfil and NotI restricted pPhSP-D phagemid DNA (cf. Exam- 
ple 10) with 10 ^g of a Sfil and NotI restricted DNA frag- 

2 0 ment population encoding the appropriately randomised loop 
1 and 4 regions. The DNA fragment population is amplified, 
from nine assembly reactions combining each of the three 
loop 1 DNA fragments with each of the three loop 4 DNA 
fragments as templates and the oligonucleotides Sfi-tag 5'- 

25 CGGCTGAGCGGCCCAGC-3 ' (SEQ ID NO: 74 ) and Not-tag 5'- 

GCACTCCTGCGGCCGCG-3' (SEQ ID NO: 75) as primers using stan- 
dard procedures. Each of the three loop 1 fragments is am- 
plified in a primary PCR reaction with pPhSP-D phagemid DNA 
(cf. Example 10) as template and the oligonucleotides Sp- 

30 dloopla fo.(SEQ ID NO: 76), Sp-dlooplb fo (SEQ ID NO: 77) or 
Sp-dlooplc fo (SEQ ID NO: 78) and SfiSP-D (SEQ ID NO: 64) as 
primers, and further amplified in a PCR reaction using Sfi- 
tag (SEQ ID NO:74) and Sp-dloopl-tag fo (SEQ id NO:79) as 
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primers. Each of the three DNA loop 4 fragments is ampli- 
fied in a primary PCR reaction with pPhSP-D phagemid DNA 
(cf. Example 10) as template and the oligonucleotides Sp- 
dloop4a rev (SEQ ID NO: 81), Sp-dloop4b rev (SEQ ID NO: 82) 
5 or Sp-dloop4c rev (SEQ ID NO: 83) and NotSP-D (SEQ ID NO: 65) 
as primers using standard procedures and further amplified 
in a PCR reaction using Sp-dloop4-tag rev (SEQ ID NO: 80) 
and Not-tag (SEQ ID NO: 75) as primers. In the oligonucleo- 
tide sequences N denotes a mixture of 25% of each of the 

10 nucleotides T, C, G, and A, respectively, and S denotes a 
mixture of 50 % of C and G, encoding the appropriately ran- 
domized nucleotide sequence. The ligation mixture is used 
to transform so-called electrocompetent E. coli TG-1 cells 
by electroporation using standard procedures. After trans- 

15 formation the E. coli TG-1 cells are plated on 2xTY-agar 
plates containing 0.2 mg ampicillin/mL and 2% glucose and 
incubated over night at 30 *C . 



Example 13 

20 Construction of the phage library PhtCTLD-lb004 

All oligonucleotides used in this example were supplied by 
DNA Technology (Aarhus, Denmark) . 

The phage library PhtCTLD-lb004 , containing random amino 
acid residues corresponding to PhtCTLD (SEQ ID NO: 15) posi- 

25 tions 97 to 102 or 98 to 101 (loop 3) and positions 116 to 
122 or 118 to 120 (loop 5) was constructed by ligation of 
20 ug Kpnl and Muni restricted pPhtCTLD phagemid DNA (cf. 
Example 1) with 10 ug of a Kpnl and Muni restricted DNA 
fragment population encoding the randomised loop 3 and 5 

30 regions. The DNA fragment population was amplified from 

nine primary pcr reactions combining each of the three loop 
3 DNA fragments with each of the three loop 5 DNA frag- 
ments. The frarrmpnhs was amnlified with e.i t he>r- nf hhp nli- 
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gonucleotides loop3a (SEQ ID NO: 84), loop3b (SEQ ID NO: 
85), or loop3c (SEQ ID NO: 86) as template and loop5a(SEQ ID 
NO: 87 ) , loop5b(SEQ ID NO: 88) or loop5c(SEQ ID NO: 89) and 
loop3-4rev(SEQ ID NO: 91) as primers. The DNA fragments were 
5 further amplified in PCR reactions, using the primary PCR 
product as template and the oligonucleotide loop3-4rev (SEQ 
ID NO:91) and loop3~4-5tag fo (SEQ ID NO:90) as primers. 
All PCR reactions were performed using standard procedures. 

In the oligonucleotide sequences N denotes a mixture of 25% 
10 of each of the nucleotides T, C, G, and A, respectively and 
S denotes a mixture of 50 % of C and G, encoding the appro- 
priately randomised nucleotide sequence. The ligation mix- 
ture was used to transform so-called electrocompetent. E . 
coli TG-1 cells by electroporation using standard proce- 
15 dures. After transformation the E. coli TG-1 cells were 

plated on 2xTY-agar plates containing 0.2 mg ampicillin/mL 
and 2% glucose and incubated over night at 30 °C. 

The size of the resulting library, PhtCTLD-lb004, was de- 
termined to 7*109 clones. Sixteen clones from the library 
20 were picked and phagemid DNA isolated. The nucleotide se- 
quence of the loop-regions were determined (DNA Technology, 
Aarhus, Denmark) . Thirteen clones were found to contain 
correct loop inserts and three clones contained a 
frameshift mutation in the region. 

25 

Example 14 

Selection of Phtlec-phages and PhtCTLD-phages binding to 
the blood group A sugar moiety immobilised on human serum 
albumin 

30 Phages grown from glycerol stocks of the libraries Phtlec- 
lbOOl and Phtlec-lb002 (cf. Example 4) and phages grown 
from a glycerol stock of the library PhtCTLD-lb003 (cf . Ex- 
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ample 5) , using a standard procedure, were used in an ex- 
periment designed for the selection of Phtlec- and PhtCTLD 
derived phages with specific affinity to the blood group A 
sugar moiety immobilized on human serum albumin, A-HA, by 
5 panning in 96-well Maxisorb micro-titerplates (NUNC, Den- 
mark) using standard procedures. 

Initially, the phage supernatants were precipitated with 
0.3 vol. of a solution of 20% polyethylene glycol 6000 
(PEG) and 2,5 M NaCl, and the pellets re-suspended in TE- 

10 buffer (10 mM Tris-HCl pH 8, 1 mM EDTA) . After titration on 
E. coli TG-1 cells, phages derived from Phtlec-lbOOl and - 
• lb002 were mixed (#1) in a 1 : 1 ratio and adjusted to 5*1012 
pfu/mL in 2*TY medium, and phages grown from the PhtCTLD- 
lb003 library (#4) were adjusted to 2.5*1012 pfu/mL in 2*TY 

15 medium. 

One microgram of the "antigen", human blood group A trisac- 
charide immobilised on human serum albumin, A- HA, (Glycorex 
AB, Lund, Sweden) in 100 jaL PBS (PBS, 0.2 g KCl, 0.2 g 
KH 2 PO<, 8 g NaCl, 1.44 g Na c HPO<, 2H C 0, water to 1 L, and 

20 adjusted to pH 7.4 with NaOH) , in each of three wells, was 
coated over night at 4 °C and at room temperature for one 
hour, before the first round of panning. After washing once 
with PBS, wells were blocked with 300 \iL PBS and 3% non fat 
dried milk for one hour at room temperature. After blocking 

25 wells were washed once in PBS and 0.1% Tween 20 and three 

times with PBS before the addition of a mixture of 50 \xh of 
the phage suspension and 50 \ih PBS, 6% non fat dried milk. 
The phages were allowed to bind at room temperature for two 
hours before washing eight times with PBS, Tween 20,. and 

30 eight times with PBS. Bound phages were eluted from each 
well by trypsin digestion in 100 |iL (1 mg/mL trypsin in 
PBS) for 30 min. at room temperature, and used for infec- 
tion of exponentially growing E. coli TGI cells before 
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plating and titration on 2xTY agar plates containing 2% 
glucose and 0.1 mg/mL ampicillin. 

In the second round of selection, 150 \xh of crude phage su- 
pernatant, grown from the first round output colonies, was 
5 mixed with 150 |iL PBS, 6% non fat dried milk, and used for 
panning distributing 100 jjL of the mixture in each of three 
A- HA coated wells, as previously described. Stringency in 
binding was increased by increasing the number of washing 
steps from 16 to 32. 300 [xL of phage mixture was' also used 
10 for panning in three wells, which had received no antigen 
as control. 

In the third round of selection, 150 ]ih of crude phage su- 
pernatant, grown from the second round output colonies, was 
mixed with 150 \xh PBS, 6% non fat dried milk, and used for 
15 panning distributing 100 [iL of the mixture in each of three 
A- HA coated wells, as previously described. The number of 
washing steps was again 32. 300 of phage mixture was 
also used for panning in three wells, which had received no 
antigen as control. 

20 The results from the selection procedure are summarised in 
Table 8 
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Table 8. Selection of Phtlec phages (#1) and PhtCTLD 

phages (#4) binding to A- HA by panning and elu- 
tion with trypsin digestion. 





A- HA 


Blank 


Ratio 


Round 1 #1 


0. 8*10 3 


n.a. 


n.a. 


#4 


l.mo 3 


n.a. 


n.a. 


Round 2 #1 


l.ono 3 • 


0.5*10- 


20 


#4 


1.3*10 3 • 


0.5*10 2 


26 


Round 3 #1 


8 .ono 4 


0.5*10- 


1600 


#4 


9.ono 5 


0.5*10 2 


18000 



n.a. not applicable. 



5 48 clones from each of the #1 and #4 series were picked and 
grown in a 96 well microtiter tray and phages produced by 
infection with M13K07 helper phage using a standard proce- 
dure. Phages from the 96 phage supernatants were analysed 
for binding to the A-HA antigen and for non-specific bind- 

•0 ing to hen egg white lysozyme using an ELISA-type assay. 
Briefly, in each well 1 \ig of A-HA in 100 |iL PBS (PBS, 0.2 
g KC1, 0.2 g KHcPO,, 8 g NaCl, 1.44 g Na : HP0 4 , 2H : 0, water 
to 1 L, and adjusted to pH 7.4 with NaOH) or 1 ug of hen 
egg white lysozyme in 100 uL PBS (for analysis of non spe- 

5 cific binding) was used for over night coating at 4 °C and 
at room temperature for one hour. After washing once with 
PBS, wells were blocked with 300 uL PBS and 3% non fat 
dried milk for one hour at room temperature. After blocking 
wells were washed once in PBS and 0.1% Tween 20 and three. 

0 times with PBS before the addition of 50 uL phage super- 
natant in 50 uL PBS, 6% non fat dried milk. The phage mix- 
tures were allowed to bind at room temperature for two 
hours before washing three times with PBS, Tween 20, and 
three times with PBS. After washing, 50 uL of a 1:5000 di- 
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lution of a HRP-conjugated anti-gene VIII antibody (Amer- 
shaia Pharmacia Biotech) in PBS, 3% non fat dried milk, was 
added to each well and incubated at room temperature for 
one hour. After binding of the "secondary" antibody wells 
5 were washed three times with PBS, Tween 20, and three times 
with PBS before the addition of 50 \iL of TMB substrate 
(DAKO-TMB One-Step Substrate System, DAKO, Denmark) . Reac- 
tion was allowed to proceed for 20 min. before quenching 
with 0.5 M H : .SOj, and analysis. The result of the EL ISA 
10 analysis showed "hits" in terms of specific binding to A- HA 
of phages in both series (fig. 34 and 35), as judged by a 
signal ratio between signal on A-HA to signal on lysozyme 
at or above 1.5, and with a signal above background. 

From the #1 series 13 hits were identified and 28 hits were 
15 identified from the #4 series. 
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CLAIMS 

1. A protein having the scaffold structure of C-type 
lectin-like domains (CTLD), said protein comprising a vari- 
ant of a model CTLD wherein the a-helices and p-strands and 

5 connecting segments are conserved to such a degree that the 
scaffold structure of the CTLD is substantially maintained, 
while the loop region is altered by amino acid substitu- 
tion, deletion, insertion or any combination thereof, with 
the proviso that said protein is not any of the known CTLD 
10 loop derivatives of C-type lectin-like proteins* or C-type 
lectins listed in Table 2 in the description. 

2. A protein according to claim 1 wherein the model CTLD is 
defined by having a 3D structure that conforms to the sec- 
ondary-structure arrangement illustrated in Fig 1 charac- 

15 terized by the following main secondary structure elements: 

five p-strands and two a-helices sequentially appearing 
in the order pi, al, a2, p2, P3, p4, and P5, the p- 
strands being arranged in two anti-parallel p-sheets, 
one composed of pi and P5, the other composed of p2, P3 
20 and p4, 

at least two disulfide bridges, one connecting al and p5 
and one connecting p3 and the polypeptide segment con- 
necting p4 and p5, and 

a loop region consisting of two polypeptide segments, 
25 loop segment A (LSA) connecting p2 and p3 and comprising 

typically 15-70 or, less typically, 5-14 amino acid 
residues, and loop segment B (LSB) .connecting P3 and P4 
and comprising typically 5-12 or less typically, 2-4 
amino acid residues. 

30 3. A protein according to claim 1 wherein the model CTLD is 
defined by showing sequence similarity to a previously rec- 
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ognised member of the CTLD family as expressed by an amino 
acid sequence identity of at least 22 %, preferably at 
least 25 % and more preferably at least 30 %, and by con- 
taining the cysteine residues necessary for establishing 
5 the conserved two-disulf ide bridge topology (i.e. Cysi, 
Cysn, Cysiu and Cysiv) , whereas the loop region and its 
flanking p~strand structural elements are identified by in- 
spection of the sequence alignment with the collection of 
CTLDs shown in Fig. 1 providing identification of the se- 

10 quence locations of the (32- and (33-strands with the further 
corroboration provided by comparison of these sequences 
with the four-residue consensus sequences, P2cseq and 
(33cseq, the £4 strand segment being located typically at 
positions -6 to -2 and less typically at positions -5 to -2 

15 relative to the conserved Cysm residue and with the char- 
acteristic residues at positions -5 and -3 as elucidated 
from Table 1 and deducted in the description. 

4. A protein according to any one of the preceding claims 
wherein up to 10, preferably up to 4, and more preferably 1 

20 or 2, amino acid residues are substituted, deleted or in- 
serted in the ot-helices and/or (3-strands and/or connecting 
segments of the model CTLD. 

5. A protein according to any one of the preceding claims 
wherein changes of up to 4 residues are made in the (3- 

25 strands of the model CTLD as a consequence of the introduc- 
tion of recognition sites for one or more restriction en- 
donucleases in the nucleotide sequence encoding the CTLD to 
facilitate the excision of part or all of the loop region 
and the insertion of an altered amino acid sequence instead 

30 while the scaffold structure of the CTLD is substantially 
maintained-. 

6. A protein according to any one of the preceding claims 
wherein the model CTLD is that of a tetranectin. 
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7. A protein according to claim 6 wherein the model CTLD is 
that of human tetranectin comprising amino acid residues 59 
to 180 in the monomer native peptide sequence thereof. 

8. A protein according to claim 6 wherein the model CTLD is 
5 that of murine tetranectin comprising amino acid residues 

59 to 180 in the monomer native peptide sequence thereof. 

9. A protein according to any one of the preceding claims 
which further comprises N-terminal and/or C-terminal exten- 
sions of the CTLD variant. 

10 10. A protein according to claim 9 wherein said N-terminal 
and/or C-terminal extensions contain effector, enzyme, fur- 
ther binding and/or multimerising functions. 

11. A protein according to claim 9 or 10 wherein said N- 
terminal and/or C-terminal extensions are the non-CTLD- 
portions of a native C-type lectin-like protein or C-type 
lectin or a "soluble" variant thereof lacking a functional 
transmembrane domain. 

12. A protein according to any one of the preceding claims 
which is a multimer of a moiety comprising the CTLD vari- 

20 ant. 

13. A protein according to claim 12 which is derived from 
the native tetranectin trimer. 

14. A protein according to claim 7 which is derived from 
the polypeptide htlec having the amino acid sequence from 

25 position 5 Glu to position 185 Val in SEQ IN NO: 13 by al- 
tering the amino acid sequence of the loop region. 

15. A protein according to claim 7 which is derived from 
the polypeptide ht.CTLD having the amino acid sequence from 
position 5 Ala to position 141 Val in SEQ in NO: 15 by al- 

30 tp.rinrr th<= 
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16. A protein according to claim 7 which is derived from 
the polypeptide hTN having the amino acid sequence from po- 
sition 5 Glu to position 185 Val in SEQ IN NO: 09 by alter- 
ing the amino acid sequence of the loop region. 

5 17. A protein according to claim 7 which is derived from 
the polypeptide hTN3 having the amino acid sequence from 
position 5 Ala to position 141 Val in SEQ IN NO: 11 by al- 
tering the amino acid sequence of the loop region, 

18. A protein according to claim 8 which is derived from 
10 the polypeptide mtlec having the amino acid sequence from 

position 5 Glu to position 185 Val in SEQ IN NO: 36 by al- 
tering the amino acid sequence of the loop region, 

19. A protein according to claim 8 which is derived from 
the polypeptide mtCTLD having the amino acid sequence from 

15 position 5 Ala to position 141 Val in SEQ IN NO: 38 by al- 
tering the amino acid sequence of the loop region. 

20. A combinatorial library of proteins having the scaffold 
structure of C-type lectin-like domains (CTLD) , said pro- 
teins comprising variants of a model CTLD wherein the a- 

20 helices and (3-strands are conserved to such a degree that 
the scaffold structure of the CTLD is substantially main- 
tained, while the loop region or parts of the loop region 
of the CTLD is randomised with respect to amino acid se- 
quence and/or number of amino acid residues. 

25 21. A combinatorial library according to claim 20 wherein 
the model CTLD is defined by having a 3D structure that 
conforms to the secondary-structure arrangement illustrated 
in Fig. 1 characterised by the following main secondary 
structure elements: 

30 five (3-strands and two a-helices sequentially appearing 

in the order pi, al, a2, p2, p3, p4, and p5, the (3~ 
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strands being arranged in two anti-parallel p-sheets, 
one composed of pi and (35, the other composed of (32, p3 
and p4, 

at least two disulfide bridges, one connecting <xl and P5 
5 and one connecting p3 and the polypeptide segment 

connecting P4 and P5, and 

a loop region consisting of two polypeptide segments, 
loop segment A (LSA) connecting p2 and 03 and comprising 
typically 15-70 or, less typically, 5-14 amino acid 
10 residues, and loop segment B (LSB) connecting 03 and 04 

and comprising typically 5-12 or less typically, 2-4 
amino acid residues. 

22. A combinatorial library according to claim 20 wherein 
the model CTLD is defined by showing sequence similarity to 

15 a previously recognised member of the CTLD family as ex- 
pressed by an amino acid sequence identity of at least 22 
%, preferably at least 25 % and more preferably at least 30 
%, and by containing the cysteine residues necessary for 
establishing the conserved two-disulf ide bridge topology 

20 (i.e. Cys^ Cysn, Cysm and Cysiv) , whereas the loop region 
and its flanking p-strand structural elements are identi- 
fied by inspection of the sequence alignment with the col- 
lection of CTLDs shown in Fig. l providing identification 
of the sequence locations of the P2- and p3-strands with 

25 the further corroboration provided by comparison of these 

sequences with the four-residue consensus sequences, P2cseq 
and P3cseq, the p4 strand segment being located typically 
at positions -6 to -2 and less typically at positions -5 to 
-2 relative to the conserved Cysm residue and with the 

30 characteristic residues at positions -5 and -3 as eluci- 
dated from Table 1 and deducted in the description. 

23. A combinatorial library according to any one of claims 
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10, preferably up to 4, and more preferably 1 or 2, amino 
acid residues are substituted, deleted or inserted in the 
a-helices and (3-strands and connecting segments of the 
model CTLD. 

5 24. A combinatorial library according to any one of claims 
20-23 of proteins comprising CTLD variants wherein changes 
of up to 4 residues are made in the model CTLD as a conse- 
quence of the introduction of recognition sites for one or 
more restriction endonucleases in the nucleotide sequence 

10 encoding the CTLD to facilitate the excision of part or all 
of a DNA segment encoding the loop region and the insertion 
of members of an ensemble of DNA fragments that collec- 
tively encode a randomised amino acid sequence instead 
while the scaffold structure of the CTLD is substantially 

15 maintained. 

25, A combinatorial library according to any one of claims 
20-24 of proteins wherein the model CTLD is that of a 
tetranectin. 

26. A combinatorial library according to claim 25 of pro- 
2 0 teins wherein the model CTLD is that of human tetranectin 

comprising amino acid residues 59 to 180 in the monomer na- 
tive peptide sequence thereof. 

27. A combinatorial library according to claim 25 of pro- 
teins wherein the model CTLD is that of murine tetranectin 

25 comprising amino acid residues 59 to 180 in the monomer na- 
tive peptide sequence thereof, 

28, A combinatorial library according to any one of claims 
20-27 of proteins which further comprise N-terminal and/or 
C-terminal ' extensions of the CTLD-variant . 

30 29. A combinatorial library according to claim 28 of pro- 
teins wherein said N-terminal and/or C-terminal extensions 
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contain effector, enzyme, further binding and/or multimer- 
ising functions. 

30. A combinatorial library according to claim 28 or 29 of 
proteins wherein said N-terminal and/or C-terminal exten- 

5 sions are the non-CTLD-portions of a native C-type lectin- 
like protein or C-type lectin or a "soluble" variant 
thereof lacking a functional transmembrane domain. 

31. A combinatorial library according to any one of claims 
20-30 of proteins which are multimers of a moiety compris- 

10 ing the CTLD variant. 

32. A combinatorial library according to claim 31 of pro- 
teins which are derived from the native tetranectin trimer. 

33. A combinatorial library according to claim 27 of pro- 
teins which are derived from the peptide htlec having the 
amino acid sequence from position 5 Glu to position 185 Val 
in SEQ IN NO: 13 by randomising the loop region with respect 
to amino acid sequence and/or number of amino acid resi- 
dues . 

34. A combinatorial library according to claim 27 of pro- 
teins which are derived from the peptide htCTLD having the 
amino acid sequence from position 5 Ala to position 141 Val 
in SEQ IN NO: 15 by randomising the loop region with respect 
to amino acid sequence and/or number of amino acid resi- 
dues . 

35. A combinatorial library according to claim 27 of pro- 
teins which are derived from the peptide hTN having the 
amino acid sequence from position 5 Glu to position 185 Val 
in SEQ IN NO: 09 by randomising the loop region with respect 
to amino acid sequence and/or number of amino acid resi- 
dues . 
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36. A combinatorial library according to claim 27 of pro- 
teins which are derived from the peptide hTN3 having the 
amino acid sequence from position 5 Ala to position 141 Val 
in SEQ IN NO: 11 by randomising the loop region with respect 

5 to amino acid sequence and/or number of amino acid resi- 
dues . 

37. A combinatorial library according to claim 28 of pro- 
teins which are derived from the peptide mtlec having the 
amino acid sequence from position 5 Glu to position 185 Val 

10 in SEQ IN NO: 36 by randomising the loop region with respect 
to amino acid sequence and/or number of amino acid resi- 
dues . 

38. A combinatorial library according to claim 28 of pro- 
teins which are derived from the peptide mtCTLD having the 

15 amino acid sequence from position 5 Ala to position 141 Val 
in SEQ IN NO: 38 by randomising the loop region with respect 
to amino acid sequence and/or number of amino acid resi- 
dues . 

39. A derivative of a native tetranectin wherein up to 10, 
2 0 preferably up to 4, and more preferably 1 or 2, amino acid 

residues are substituted, deleted or inserted in the ct- 
helices and/or (3-strands and/or connecting segments of its 
CTLD with the proviso that said derivative is not any of 
the known CTLD derivatives of human tetranectin (hTN) 
25 listed in Table 2 in the description. 

40. A derivative of human tetranectin, termed htlec, having 
the amino acid sequence from position 5 Glu to position 185 
Val in SEQ IN NO: 13. 

.41. A derivative of human tetranectin, termed htCTLD, hav- 
30 ing the amino acid sequence from position 5 Ala to position 
141 Val in SEQ IN NO: 15. 
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42. A derivative of human tetranectin, termed PhTN, having 
the amino acid sequence given in SEQ IN NO:09. 

43. A derivative of human tetranectin, termed PhTN3, having 
the amino acid sequence given in SEQ IN NO: 11. 

5 4 4. A derivative of human tetranectin, termed Phtlec, hav- 
ing the amino acid sequence given in SEQ IN NO: 13. 

45. A derivative of human tetranectin, termed PhtCTLD, hav- 
ing the amino acid sequence given in SEQ IN NO: 15. 

46. A derivative of human tetranectin, termed FX-htlec, 
10 having the amino acid sequence given in SEQ IN NO: 02. 

47. A derivative of human tetranectin, termed FX-htCTLD, 
having the amino acid sequence given in SEQ IN NO: 04. 

48. A derivative of murine tetranectin, termed mtlec, hav- 
ing the amino acid sequence from position 5 Glu to position 

15 185 Val in SEQ IN NO:36. 

49. A derivative of murine tetranectin, termed mtCTLD, hav- 
ing the amino acid sequence from position 5 Ala to position 
141 Val in SEQ IN NO: 38. 

50. A derivative of murine tetranectin, termed Pmtlec, hav- 
20 ing the amino acid sequence given in SEQ IN NO: 36. 

51 . A derivative of murine tetranectin, termed PmtCTLD, 
having the amino acid sequence given in SEQ IN NO: 38. 

52. A derivative of murine tetranectin, termed FX -mtlec, 
having the amino acid sequence given in SEQ IN NO: 29. 

25 53. A derivative of murine tetranectin, termed FX-mtCTLD, 
having the amino acid sequence given in SEQ IN NO: 31. 
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54. Nucleic acid comprising a nucleotide sequence encoding 
a htlec insert as shown from nucleotide 20 to nucleotide 
562 in SEQ ID NO: 12. 

55. Nucleic acid comprising a nucleotide sequence encoding 
5 a htCTLD insert as shown from nucleotide 20 to nucleotide 

430 in SEQ ID NO: 14. 

56. Nucleic acid comprising a nucleotide sequence encoding 
a mtlec insert as shown from nucleotide 20 to nucleotide 
562 in SEQ ID NO: 35. 

10 57. Nucleic acid comprising a nucleotide sequence encoding 
a mtCTLD insert as shown from nucleotide 20 to nucleotide 
430 in SEQ ID NO: 37. 

58. Nucleic acid comprising any nucleotide sequence encod- 
ing a protein according to any one of claims 1-19. 

15 59. A library of nucleic acids encoding proteins of a com- 
binatorial library according to any one of claims 20-38, in 
which the members of the ensemble of nucleic acids, that 
collectively constitute said library of nucleic acids, are 
able to be expressed in a display system, which provides 

20 for a logical, physical or chemical link between entities 
displaying phenotypes representing properties of the dis- 
played expression products and their corresponding geno- 
types . 

60. A library of nucleic acids according to claim 49, 
25 wherein the display system is selected from 
(I) a phage display system such as 

(1) a filamentous phage fd in which the library 
of nucleic acids is inserted into 
(a) a phagemid vector, 
30 (b) the viral genome of a phage 
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(c) purified' viral nucleic acid in puri- 
fied single- or double-stranded 
form, or 

(2) a phage lambda in which the library is in- 
5 serted into 

(a) purified phage lambda DNA, or 

(b) the nucleic acid in lambda phage par- 

ticles; or 

(II) a viral display system in which the library of 
10 nucleic acids is inserted into the viral nucleic 

acid of a eukaryotic virus such as baculovirus; 
or 

(III) a cell-based display system in which the library 
of nucleic acids is inserted into, or adjoined 

15 to, a nucleic acid carrier able to integrate ei- 

ther into the host genome or into an extrachromo- 
somal element able to maintain and express itself 
within the cell and suitable for cell-surface 
display on the surface of 

20 (a) bacterial cells, 

(b) yeast cells, or 

(c) mammalian cells/ or 

(IV) a nucleic acid entity suitable for ribosome 
linked display into which the library of nucleic 

25 acid is inserted; or 

(V) a plasmid suitable for plasmid linked display 
into which the library of nucleic acid is in- . 
serted. 

61, A library of nucleic acids according to claim 60 

30 wherein said phagemid vector is the vector "p CANTAB 5 E" 
supplied by Amersham Pharmacia Biotech (code no. 27-9401-- 
.01) for use with their "Recombinant Phage Antibody System". 

62. A method of preparing a protein according to any one of 
claims 1-19, wherein the protein comprises at least one or 
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more, identical or not identical, CTLD domains with novel 
loop-region sequences which has (have) been isolated from 
one or more CTLD libraries by screening or selection. 

63. A method of preparing a protein according to claim 62, 
5 wherein at least one CTLD domain has been further modified 

by mutagenesis. 

64. A method of preparing a protein according to claim 62 
or 63, wherein the protein containing at least one CTLD do- 
main is assembled from two or more components by chemical 

10 or enzymatic coupling or crosslinking . 

65. A method of preparing a combinatorial library according 
to any one of claims 20-38 comprising the following steps: 

1) inserting nucleic acid encoding a protein comprising a 
model CTLD into a suitable vector, 

15 2) if necessary, introducing restriction endonuclease 

recognition sites by site directed mutagenesis, said 
recognition sites being properly located in the se- 
quence at or close to the ends of the sequence encod- 
ing the loop region of the CTLD or part thereof, 

20 3) excising the DNA fragment encoding the loop region or 

part thereof by use of the proper restriction endonu- 
cleases, 

4) ligating mixtures of DNA fragments into the restricted 
vector, and 

25 5) inducing the vector to express randomised proteins 

having the scaffold structure of CTLDs in a suitable 
medium. 

•66. A method of constructing a tetranectin derivative 
adapted for the preparation of a combinatorial library ac- 
30 cording to any one of claims 20-38, wherein the nucleic 

acid encoding the tetranectin derivative has been modified 
to generate endonuclease restriction sites within nucleic 
acid segments encoding (32, 03 or (34, or up to 30 nucleo- 
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tides upstream or downstream in the sequence from any nu- 
cleotide which belongs to a nucleic acid segment encoding 
p2, p3or (34. 

67. The use of a nucleotide sequence encoding a tetranec- 
5 tin, or a derivative thereof wherein the scaffold structure 
of its CTLD is substantially maintained, for preparing a 
library of nucleotide sequences encoding related proteins 
by randomising part or all of the nucleic acid sequence en- 
coding the loop region of its CTLD. 

10 68. The use according to claim 67 wherein the nucleotide 
sequence encodes a mammalian tetranectin, 

69. The use according to claim 67 wherein the nucleotide 
sequence encodes human or murine tetranectin. 

70. The use according to claim 67 wherein the nucleotide 
15 sequence encodes a derivative of a native tetranectin 

wherein up to 10, preferably up to 4, and more preferably 1 
or 2, amino acid residues are substituted, deleted or in- 
serted in the a-helices and p-strands and connecting seg- 
ments of its CTLD. 

20 71. The use according to claim 67 wherein the nucleotide 

sequence encodes a derivative of human tetranectin, termed 
htlec, as shown from nucleotide 20 to nucleotide 562 in SEQ 
ID NO:12. 

72. The use according to claim 67 wherein the nucleotide 
25 sequence encodes a derivative of human tetranectin, termed 

htCTLD, as shown from nucleotide 20 to nucleotide 430 in 
SEQ ID NO: 14. 

73. The use according to claim 67 wherein the nucleotide 
sequence encodes a- derivative of murine tetranectin, termed 
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mtlec, as shown from nucleotide 20 to nucleotide 562 in SEQ 
ID NO:35. 

74. The use according to claim 67 wherein the nucleotide 
sequence encodes a derivative of murine tetranectin, termed 

, 5 mtCTLD, as shown from nucleotide 20 to nucleotide 430 in 
SEQ ID NO: 37. 

75. A method of screening a combinatorial library according 
to any one of claims 20-38 for binding to a specific target 
which comprises the following steps: 

10 1) expressing a nucleic acids library according to any 

one of claims 59-61 to display the library of proteins 
in the display system; 

2) contacting the collection of entities displayed with a 
suitably tagged target substance for which isolation 

15 of a CTLD-derived exhibiting affinity for said target 

substance is desired; 

3) harvesting subpopulations of the entities displayed 
that exhibit affinity for said target substance by 
means of affinity-based selective extractions, utiliz- 

20 ing the tag to which said target substance is conju- 

gated or physically attached or adhering to as a vehi- 
cle or means of affinity purification, a procedure 
commonly referred to in the field as "affinity pan- 
ning", followed by re-amplification of the sub- 

25 library; 

4) isolating progressively better binders by repeated 
rounds of panning and re-amplification until a suita- 
bly small number of good candidate binders is ob- 
tained; and, 

30 5) if desired, isolating each . of the good candidates as 

an individual clone and subjecting it to ordinary 
functional and structural characterisation in prepara- 
tion for final selection of one or more preferred 
product clones. 
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76. A method of reformatting a protein according to any one 
of claims 1-19 or selected from a combinatorial library ac- 
cording to any one of claims 20-38 and containing a CTLD 
variant exhibiting desired binding properties, in a desired 
5 alternative species-compatible framework by excising the 
nucleic acid fragment encoding the loop region-substituting 
polypeptide and any required single framework mutations 
from the nucleic acid encoding said protein using PCR tech- 
nology, site directed mutagenesis or restriction enzyme di- 
10 gestion and inserting said nucleic acid fragment into the 
appropriate location (s) in a 'display- or protein expression 
vector that harbours a nucleic acid sequence encoding the 
desired alternative CTLD framework. 
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Figure 26 
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(Panel A) Phtlec PhTN3 Help.Ph 
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Figure 28 




Figure 29 
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Figure 34 
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SEQUENCE LISTING 

<110> Borean Pharma A/S 

<120> Combinatorial libraries of proteins having the scaffold 
structure of C-type lectin-like domains 

<130> P200001100 WO JNy 

<140> 
<141> 

<150> DK PA 2000 01872 
<151> 2000-12-13 

<160> 91 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 571 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (1) . . (564) 

<223> FX-htlec encoding insert 

<400> 1 

gga tec ate gag ggt agg ggc gag 
Gly Ser lie Glu Gly Arg Gly Glu 
1 5 

att gta aat gec aag aaa gat 
lie Val Asn Ala Lys Lys Asp 
20 



cca' cca acc cag aag ccc aag aag 
Pro Pro Thr Glh Lys Pro Lys Lys 
10 15 



gtt gtg aac aca aag atg ttt gag gag 
Val Val Asn Thr Lys Met Phe Glu Glu 
25 30 



etc aag age cgt ctg gac acc ctg gee cag gag gtg gec ctg ctg aag 
Leu Lys Ser Arg Leu Asp Thr Leu Ala Gin Glu Val Ala Leu Leu Lys 
35 40 45 



gag cag cag gec ctg cag acg gtc gtc ctg aag ggg acc aag gtg cac 192 

Glu Gin Gin Ala Leu Gin Thr Val Val Leu Lys Gly Thr Lys Val His 

50 55 60 

atg aaa gtc ttt ctg gec ttc acc cag acg aag acc ttc cac gag gec 240 

Met Lys Val Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe His Glu Ala 

65 70 75 80 

age gag gac tgc ate teg cgc ggg ggc acc ctg age acc cct cag act 288 

Ser Glu Asp Cys lie Ser Arg Gly Gly Thr Leu Ser Thr Pro Gin Thr 

85 90 95 



ggc teg gag aac gac gee ctg tat gag tac ctg cgc cag age gtg ggc 
Gly Ser Glu Asn' Asp Ala Leu Tyr Glu Tyr Leu Arg Gin Ser Val Gly 
100 105 110 



336 



aac gag gec gag ate tgg ctg ggc etc aac gac atg gcg gee gag ggc 
Asn Glu Ala Glu lie Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly 
115 120 125 



384 
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acc tgg gtg gac atg acc ggt acc cgc ate gec tac aag aac tgg gag 432 
Thr Trp Val Asp Met Thr Gly Thr Arg He Ala Tyr Lys Asn Trp Glu 
130 135 140 

act gag ate acc gcg caa ccc gat ggc ggc aag acc gag aac tgc gcg 480 
Thr Glu He Thr Ala Gin Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala 
145 150 155 160 

gtc ctg tea ggc gcg gec aac ggc aag tgg ttc gac aag cgc tgc cgc 528 
Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 
165 170 175 

gat caa ttg ccc tac ate tgc cag ttc ggg ate gtg taagctt 571 
Asp Gin Leu Pro Tyr He Cys Gin Phe Gly He Val 
180 185 



<210> 2 
<211> 188 
<212> PRT 

<213> Homo sapierts 
<400> 2 

Gly Ser He Glu Gly Arg Gly Glu Pro Pro Thr Gin Lys Pro Lys Lys 
1 5 10 15 

He Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu 
20 25 30 

Leu Lys Ser Arg Leu Asp Thr Leu Ala Gin Glu Val Ala Leu Leu Lys 
35 40 45 

Glu Gin Gin Ala Leu Gin Thr Val Val Leu Lys Gly Thr Lys Val His 
50 55 60 

Met Lys Val Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe His Glu Ala 
65 70 75 80 

Ser Glu Asp Cys He Ser Arg Gly Gly Thr Leu Ser Thr Pro Gin Thr 
85 " 90 95 

Gly Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin Ser Val Gly 
100 105 " ~ 110 

Asn Glu Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly 
115 120 125 

Thr Trp Val Asp Met Thr Gly Thr Arg He Ala Tyr Lys Asn Trp Glu 
130 135 140 

Thr Glu He Thr Ala Gin Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala 
145 150 155 160 

Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg 
165 170 175 

Asp Gin Leu Pro Tyr lie Cys Gin Phe Gly He Val 
180 185 
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<210> 3 
<211> 436 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (429) 

<223> FX-htCTLD encoding insert 

<400> 3 

gga tec ate gag ggt agg gec ctg cag acg gtc gtc ctg aag ggg acc 48 
Gly Ser He Glu Gly Arg Ala Leu Gin Thr Val Val Leu Lys Gly Thr 
15 10 15 

aag gtg cac atg aaa gtc ttt ctg gec # ttc acc cag acg aag acc ttc 96 
Lys Val His Met Lys Val Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe 
20 25 30 

cac gag gec age gag gac tgc ate teg cgc ggg ggc acc ctg age acc 144 
His Glu Ala Ser Glu Asp Cys He Ser Arg Gly Gly Thr Leu Ser Thr 
35 40 45 

cct cag act ggc teg gag aac gac gec ctg tat gag tac ctg cgc cag 192 
Pro Gin Thr Gly Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin 
50 55 60 

age gtg ggc aac gag gec gag ate tgg ctg ggc etc aac gac atg gcg 240 
Ser Val Gly Asn Glu Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala 
65 70 75 80 

gec gag ggc acc tgg gtg gac atg acc ggt acc cgc ate gec tac aag 288 
Ala Glu Gly Thr Trp Val Asp Met Thr Gly Thr Arg He Ala Tyr Lys 
85 90 95 

aac tgg gag act gag ate acc gcg caa ccc gat ggc ggc aag acc gag 336 
Asn Trp Glu Thr Glu He Thr Ala Gin Pro Asp Gly Gly Lys Thr Glu 
100 105 110 

aac tgc gcg gtc ctg tea ggc gcg gee aac ggc aag tgg ttc gac aag 384 
Asn Cys Ala Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys 
115 120 125 

cgc tgc cgc gat caa ttg ccc tac ate tgc cag ttc ggg ate gtg 429 
Arg Cys Arg Asp Gin Leu Pro Tyr He Cys Gin Phe Gly He Val 
130 135 140 

taagctt 436 



<210> 4 
<211> 143 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Gly Ser He Glu Gly Arg Ala Leu Gin Thr Val Val Leu Lys Gly Thr 
1 5 10 15 



Lys Val His Met Lys Val Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe 
20 25 30 
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His Glu Ala Ser Glu Asp Cys He Ser Arg Gly Gly Thr Leu Ser Thr 
35 40 45 

Pro Gin Thr Gly Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin 
50 55 " 60 

Ser Val Gly Asn Glu Ala Glu lie Trp Leu Gly Leu Asn Asp Met Ala 
65 7 ° 75 * 80 

Ala Glu Gly Thr Trp Val Asp Met Thr Gly Thr Arg lie Ala Tyr Lys 
85 90 95 

Asn Trp Glu Thr Glu He Thr Ala Gin Pro Asp Gly Gly Lys Thr Glu 
100 105 no 

Asn Cys Ala Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys 
115 120 125 

Arg Cys Arg Asp Gin Leu Pro Tyr He Cys Gin Phe Gly He Val 
130 135 140 



<210> 5 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 5 

cggctgagcg gcccagccgg ccatggccga gccaccaacc cagaagc 

<210> 6 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 6 

cctgcggccg ccacgatccc gaactgg 



<210> 7 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 7 

cggctgagcg gcccagccgg ccatggccgc cctgcagacg . gtc 



WO 02/48189 



PCT/DK01/00825 



<210> 8 

<211> 570 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (8) . . (565) 

<223> PhTN encoding insert 

<400> 8 

ggcccag ccg gcc atg gcc gag cca cca acc cag aag ccc aag aag att 49 

Pro Ala Met Ala Glu Pro Pro Thr Gin Lys Pro Lys Lys He 
15 10 

gta aat gcc aag aaa gat gtt gtg aac aca aag atg ttt gag gag etc 97 
Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu 
15 20 25 30 

aag age cgt ctg gac acc ctg gcc cag gag gtg gcc ctg ctg aag gag 145 
Lys Ser Arg Leu Asp Thr Leu Ala Gin Glu Val Ala Leu Leu Lys Glu 
35 4 0 45 

cag cag -gcc ctg cag acg gtc tgc ctg aag ggg acc aag gtg cac atg 193 
Gin Gin Ala Leu Gin Thr Val Cys Leu Lys Gly Thr Lys Val His Met 
50 55 60 

aaa tgc ttt ctg gcc ttc acc cag acg aag acc ttc cac gag gcc age 241 
Lys Cys Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe His Glu Ala Ser 
65 70 75 

gag gac tgc ate teg cgc ggg ggc acc ctg age acc cct cag act ggc 289 
Glu Asp Cys He Ser Arg Gly Gly Thr Leu Ser Thr Pro Gin Thr Gly 
80 85 90 

teg gag aac gac gcc ctg tat gag tac ctg cgc cag age gtg ggc aac 337 
Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin Ser Val Gly Asn 
95 100 105 110 

gag gcc gag ate tgg ctg ggc etc aac gac atg gcg gcc gag ggc acc 385 
Glu Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Thr 
115 120 125 

tgg gtg gac atg acc ggc gcc cgc ate gcc tac aag aac tgg gag act 433 
Trp Val Asp Met Thr Gly Ala Arg He Ala Tyr Lys Asn Trp Glu Thr 
130 135 140 

gag ate acc gcg caa ccc gat ggc ggc aag acc gag aac tgc gcg gtc 481 
Glu He Thr Ala Gin Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala Val 
145 150 155 

ctg tea ggc gcg gcc aac ggc aag tgg ttc gac aag cgc tgc cgc gat 529 
Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp 
160 165 170 

cag ctg ccc tac ate tgc cag ttc ggg ate gtg gcg geege 570 
Gin Leu Pro Tyr He Cys Gin Phe Gly He Val Ala 
175 180 185 
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<210> 9 
<211> 186 
<212> PRT 

<213> Homo sapiens 
<400> 9 

Pro Ala Met Ala Glu Pro Pro Thr Gin Lys Pro Lys Lys lie Val Asn 
1 5 10 15 

Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser 
20 25 30 

Arg Leu Asp Thr Leu Ala Gin Glu Val Ala Leu Leu Lys Glu Gin Gin 
35 40 45 

Ala Leu Gin Thr Val Cys Leu Lys Gly. Thr Lys Val His Met Lys Cys 
50 55 60 

Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe His Glu Ala Ser Glu Asp 
65 70 75 80 

Cys He Ser Arg Gly Gly Thr Leu Ser Thr Pro Gin Thr Gly Ser Glu 
85 90 95 

Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin Ser Val Gly Asn Glu Ala 
100 105 HO 

Glu He Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Thr Trp Val 
115 120 125 

Asp Met Thr Gly Ala Arg He Ala Tyr Lys Asn Trp Glu Thr Glu He 
130 135 140 

Thr Ala Gin Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala Val Leu Ser 
145 150 155 160 

Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gin Leu 
165 170 175 

Pro Tyr He Cys Gin Phe Gly He Val Ala 
180 185 



<210> 10 
<211> 438 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (8) . . (433) 

<223> PhTN3 encoding insert 

<400> 10 

ggcccag ccg gcc atg gcc gcc ctg cag acg gtc tgc ctg aag ggg acc 49 

Pro Ala Met Ala Ala Leu Gin Thr Val Cys Leu Lys Gly Thr 
1.5 10 



aag gtg cac atg aaa tgc ttt ctg gcc ttc acc cag acg aag acc ttc 97 
Lys Val His Met Lys Cys Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe 
15 20 25 " 30 
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cac gag gcc age gag gac tgc ate teg cgc ggg ggc ace ctg age acc 145 
His Glu Ala Ser Glu Asp Cys He Ser Arg Gly Gly Thr Leu Ser Thr 
35 40 45 

cct cag act ggc teg gag aac gac gcc ctg tat gag tac ctg cgc cag 193 
Pro Gin Thr Gly Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin 
50 ~ 55 60 

age gtg ggc aac gag gcc gag ate tgg ctg ggc etc aac gac atg gcg 241 
Ser Val Gly Asn Glu Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala 
65 70 75 

gcc gag ggc acc tgg gtg gac atg acc ggc gcc cgc ate gcc tac aag 289 
Ala Glu Gly Thr Trp Val Asp Met Thr Gly Ala Arg He Ala Tyr Lys 
80 85 90 

aac tgg gag act gag ate acc gcg caa ccc gat ggc ggc aag acc gag 337 
Asn Trp Glu Thr Glu He Thr Ala Gin Pro Asp Gly Gly Lys Thr Glu 
95 100 105 110 

aac tgc gcg gtc ctg tea ggc gcg gcc aac ggc aag tgg ttc gac aag 385 
Asn Cys Ala Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys 
115 120 125 

cgc tgc cgc gat cag ctg ccc tac ate tgc cag ttc ggg ate gtg gcg 433 
Arg Cys Arg Asp Gin Leu Pro Tyr He Cys Gin Phe Gly He Val Ala 
130 135 140 



geege 



438 



<210> 11 

<211> 142 

<212> PRT 

<213> Homo sapiens 

<400> 11 

Pro Ala Met Ala Ala Leu Gin Thr Val Cys Leu Lys Gly Thr Lys Val 
15 10 15 

His Met Lys Cys Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe His Glu 
20 25 30 

Ala Ser Glu Asp Cys He Ser Arg Gly Gly Thr Leu Ser Thr Pro Gin 
35 ^ 40 45 

Thr Gly Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin Ser Val 
50 55 60 

Gly Asn Glu Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala Ala Glu 
65 70 75 80 

Gly Thr Trp Val Asp Met Thr Gly Ala Arg He Ala Tyr Lys Asn Trp 
85 90 95 

Glu Thr Glu He Thr Ala Gin Pro Asp Gly Gly Lys Thr Glu Asn Cys 
100 105 110 

Ala Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys 
115 120 125 
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Arg Asp Gin Leu Pro Tyr lie Cys Gin Phe Gly He Val Ala 
130 135 140 



<210> 12 

<211> 570 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (8) . . (565) 

<223> Phtlec encoding insert 

<400> 12 

ggcccag ccg gcc atg gcc gag cca cca acc cag aag ccc aag aag att 49 
Pro Ala Met Ala Glu Pro Pro Thr Gin Lys Pro Lys Lys He 
1 5 io 

gta aat gcc aag aaa gat gtt gtg aac aca aag atg ttt gag gag etc 97 
Val Asn Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu 
15 • 20 25 30 

aag age cgt ctg gac acc ctg gcc cag gag gtg gcc ctg ctg aag gag 145 
Lys Ser Arg Leu Asp Thr Leu Ala Gin Glu Val Ala Leu Leu Lys Glu 
35 40 45 

cag cag gcc ctg cag acg gtc gtc ctg aag ggg acc aag gtg cac atg 193 
Gin Gin Ala Leu Gin Thr Val Val Leu Lys Gly Thr Lys Val His Met 
50 55 * 60 

aaa gtc ttt ctg gcc ttc acc cag acg aag acc ttc cac gag gcc age 241 
Lys Val Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe His Glu Ala Ser 
65 70 75 

gag gac tgc ate teg cgc ggg ggc acc ctg age acc cct cag act ggc 289 
Glu Asp Cys He Ser Arg Gly Gly Thr Leu Ser Thr Pro Gin Thr Gly 
80 85 90 

teg gag aac gac gcc ctg tat gag tac ctg cgc cag age gtg ggc aac 337 
Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin Ser Val Gly Asn 
95 100 105 no 

gag gcc gag ate tgg ctg ggc etc aac gac atg gcg gcc gag ggc acc 385 
Glu Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Thr 
115 120 125 

tgg gtg gac atg acc ggt acc cgc ate gcc tac 'aag aac tgg gag act 433 
Trp Val Asp Met Thr Gly Thr Arg He Ala Tyr Lys Asn Trp Glu Thr 
130 135 140 

gag ate acc gcg caa ccc gat ggc ggc aag acc gag aac tgc gcg gtc 481 
Glu He Thr Ala Gin Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala Val 
145 150 155 

ctg tea ggc gcg gcc aac ggc aag tgg ttc gac aag cgc tgc cgc gat 529 
Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp 
160 . 165 no 
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caa ttg ccc tac ate tgc cag ttc ggg ate gtg gcg geege 570 
Gin Leu Pro Tyr lie Cys Gin Phe Gly lie Val Ala 
175 180 185 



<210> 13 
<211> 186 
<212> PRT 

<213> Homo sapiens 
<400> 13 

Pro Ala Met Ala Glu Pro Pro Thr Gin Lys Pro Lys Lys lie Val Asn 
15 10 15 

Ala Lys Lys Asp Val Val Asn Thr Lys Met Phe Glu Glu Leu Lys Ser 
20 25 30 

Arg Leu Asp Thr Leu Ala Gin Glu Val Ala Leu Leu Lys Glu Gin Gin 
35 40 45 

Ala Leu Gin Thr Val Val Leu Lys Gly Thr Lys Val His Met Lys Val 
50 55 60 

Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe His Glu Ala Ser Glu Asp 
65 70 75 80 

Cys lie Ser Arg Gly Gly Thr Leu Ser Thr Pro Gin Thr Gly Ser Glu 
85 90 95 

Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin Ser Val Gly Asn Glu Ala 
100 105 110 

Glu He Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Thr Trp Val 
115 120 125 

Asp Met Thr Gly Thr Arg He Ala Tyr Lys Asn Trp Glu Thr Glu He 
130 135 140 

Thr Ala Gin Pro Asp Gly Gly Lys Thr Glu Asn Cys Ala Val Leu Ser 
145 150 155 160 

Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gin Leu 
165 170 175 

Pro Tyr He Cys Gin Phe Gly He Val Ala 
180 185 



<210> 14 

<211> 438 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (8) . . (433) 

<223> PhtCTLD encoding insert 
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<400> 14 

ggcccag ccg gcc atg gcc gcc ctg cag acg gtc gtc ctg aag ggg acc 49 
Pro Ala Met Ala Ala Leu Gin Thr Val Val Leu Lys Gly Thr 
1 5 10 

aag gtg cac atg aaa gtc ttt ctg gcc ttc acc cag acg aag acc ttc 97 
Lys Val His Met Lys Val Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe 
15 20 25 J 30 

cac gag gcc age gag gac tgc ate teg cgc ggg ggc acc ctg age acc 145 

His Glu Ala Ser Glu Asp Cys He Ser Arg Gly Gly Thr Leu Ser Thr 
35 40 45 

cct cag act ggc teg gag aac gac gcc ctg tat gag tac ctg cgc cag 193 

Pro Gin Thr Gly Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin 
50 55 60 

age gtg ggc aac gag gcc gag ate tgg ctg ggc etc aac gac atg gcg 241 

Ser Val Gly Asn Glu Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala 
65 70 75 

gcc gag ggc acc tgg gtg gac atg acc ggt acc cgc ate gcc tac aag 289 

Ala Glu Gly Thr Trp Val Asp Met Thr Gly Thr Arg He Ala Tyr Lys 
8 0 85 90 

aac tgg gag act gag ate acc gcg caa ccc gat ggc ggc aag acc gag 337 

Asn Trp Glu Thr Glu He Thr Ala Gin Pro Asp Gly Gly Lys Thr Glu 
95 100 105 * " 110 

aac tgc gcg gtc ctg tea ggc gcg gcc aac ggc aag tgg ttc gac aag 385 

Asn Cys Ala Val Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys 
115 120 " 125 

cgc tgc cgc gat caa ttg ccc tac ate tgc cag ttc ggg ate gtg gcg 433 

Arg Cys Arg Asp Gin Leu Pro Tyr He Cys Gin Phe Gly He Val Ala 

130 135 " 140 

geege 438 



<210> 15 
<211> 142 
<212> PRT 

<213> Homo sapiens 
<400> 15 

Pro Ala Met Ala Ala Leu Gin Thr Val Val Leu Lys Gly Thr Lys Val 
1 5 10 ' 15 

His Met Lys Val Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe His Glu 
20 25 30 

Ala Ser Glu Asp Cys He Ser Arg Gly Gly Thr Leu Ser Thr Pro Gin 
35 40 45 

Thr Gly Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin Ser Val 
50 - 55 60 

Gly Asn Glu Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala Ala Glu 
65 70 75 80 
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Gly Thr Trp Val Asp Met Thr Gly Thr Arg He Ala Tyr Lys Asn Trp 
85 90 95 

Glu Thr Glu He Thr Ala Gin Pro Asp Gly Gly Lys Thr Glu Asn Cys 
100 105 HO 

Ala Val Leu -Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys 
115 120 125 

Arg Asp Gin Leu Pro Tyr He Cys Gin Phe Gly He Val Ala 
130 135 140 



<210> 16 

<211> 555 

<212> DNA 

<213> Mus musculus 

<220> 

<223> EcoRI to Hindlll insert containing mtlec encoding 
part 

<400> 16 

ggaattcgag tcacccactc ccaaggccaa gaaggctgca aatgccaaga aagatttggt 60 
gagctcaaag atgtcgagga gctcaagaac aggatggatg tcctggccca ggaggtggcc 120 
ctgctgaagg agaagcaggc cttacagact gtggtcctga agggcaccaa ggtgaacttg 180 
aaggtcctcc tggccttcac ccaaccgaag accttccatg aggcgagcga ggactgcatc 240 
tcgcaagggg gcacgctggg caccccgcag tcagagctag agaacgaggc gctgttcgag 300 
tacgcgcgcc acagcgtggg caacgatgcg gagatctggc tgggcctcaa cgacatggcc 360 
gcggaaggcg cctgggtgga catgaccggt accctcctgg cctacaagaa ctgggagacg 420 
gagatcacga cgcaacccga cggcggcaaa gccgagaact gcgccgccct gtctggcgca 480 
gccaacggca agtggttcga caagcgatgc cgcgatcaat tgccctacat ctgccagttt 540 
gccattgtga agctt 555 



<210> 17 
<211> 77 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 17 

cggaattcga gtcacccact cccaaggcca agaaggctgc aaatgccaag aaagatttgg 60 
tgagctcaaa gatgttc 77 



<210> 18 
<211> 94 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 18 

gcggatccag gcctgcttct ccttcagcag ggccacctcc tgggccagga catccatcct 60 
gttcttgagc tcctcgaaca tctttgagct cacc 94. 
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<210> 19 
<211> 97 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 19 

gcaggcctta cagactgtgt gcctgaaggg caccaaggtg aacttgaagt gcctcctggc 60 
cttcacccaa ccgaagacct tccatgaggc gagcgag 97 

<210> 20 
<211> 93 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 20 

ccgcatgctt cgaacagcgc ctcgttctct agctctgact gcggggtgcc cagcgtgccc 60 
ccttgcgaga tgcagtcctc gctcgcctca tgg " ~ 93 

<210> 21 
<211> 61 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 21 

ggttcgaata cgcgcgccac agcgtgggca acgatgcgga gatctaaatg ctcccaattg 60 
c 61 

<210> 22 
<211> 55 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 22 

ccaagcttca caatggcaaa ctggcagatg tagggcaatt gggagcattt agate 55 

<210> 23 
<211> 86 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 23 

cggagatctg gctgggcctc aacgacatgg ccgcggaagg cgcctgggtg gacatgaccg 60 
gtaccctcct ggcctacaag aactgg 86 



<210> 24 
<211> 130 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 24 

gggcaattga tcgcggcatc gcttgtcgaa cctcttgccg ttggctgcgc cagacagggc 60 

ggcgcagttc tcggctttgc cgccgtcggg ttgcgtcgtg atctccgtct cccagttctt 120 
gtaggccagg 130 



<210> 25 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 25 

ctgggatcca tccagggtcg cgagtcaccc actcccaagg 4 0 



<210> 26 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 26 

ccgaagctta cacaatggca aactggc 27 



<210> 27 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 27 

ctgggatcca tccagggtcg cgccttacag act'gtggtc 39 
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<210> 28 

<211> 568 

<212> DNA 

<213> Nus musculus 

<220> 

<221> CDS 6 

<222> (1) . . (561) 

<223> FX-mtlec encoding insert 

<400> 28 

gga tec ate cag ggt cgc gag tea ccc act ccc aag gec aag aag get 48 

Gly Ser He Gin Gly Arg Glu Ser Pro Thr Pro Lys Ala Lys Lys Ala 
1 5 10 15 

gca aat gee aag aaa gat ttg gtg age tea aag atg ttc gag gag etc 96 
Ala Asn Ala Lys Lys Asp Leu Val Ser Ser Lys Met Phe Glu Glu Leu 
20 25 30 

aag aac agg atg gat gtc ctg gee cag gag gtg gee ctg ctg aag gag 144 
Lys Asn Arg Met Asp Val Leu Ala Gin Glu Val Ala Leu Leu Lys Glu 
35 40 45 

aag cag gec tta cag act gtg gtc ctg aag ggc acc aag gtg aac ttg 192 
Lys Gin Ala Leu Gin Thr Val Val Leu Lys Gly Thr Lys Val Asn Leu 



50 55 



60 



aag gtc etc ctg gee ttc acc caa ccg aag acc ttc cat gag gcg age 
Lys Val Leu Leu Ala Phe Thr Gin Pro Lys Thr Phe His Glu Ala Ser 



65 70 75 



80 



240 



gag gac tgc ate teg caa ggg ggc acg ctg ggc acc ccg cag tea gag 288 
Glu Asp Cys He Ser Gin Gly Gly Thr Leu Gly Thr Pro Gin Ser Glu 
85 90 95 

eta gag aac gag gcg ctg ttc gag tac gcg cgc cac age gtg ggc aac 336 
Leu Glu Asn Glu Ala Leu Phe Glu Tyr Ala Arg His Ser Val Gly Asn 
100 105 no 

gat gcg gag ate tgg ctg ggc etc aac gac atg gec gcg gaa ggc gee 384 
Asp Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Ala 
115 120 125 

tgg gtg gac atg acc ggt acc etc ctg gec tac aag aac tgg gag acg 432 
Trp Val Asp Met Thr Gly Thr Leu Leu Ala Tyr Lys Asn Trp Glu Thr 
130 135 140 

gag ate acg acg caa ccc gac ggc ggc aaa gec gag aac tgc gec gec 480 
Glu He Thr Thr Gin Pro Asp Gly Gly Lys Ala Glu Asn Cys Ala Ala 
145 150 155 . ~ 160 

ctg tct ggc gca gee aac ggc aag tgg ttc gac aag cga tgc cgc gat 528 
Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp 
165 . 170 * 175 

caa ttg ccc tac ate tgc cag ttt gee att gtg taagctt 568 
Gin Leu Pro Tyr He Cys Gin Phe Ala He Val 
180 185 
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<210> 29 
<211> 187 
<212> PRT 

<213> Mus musculus 
<400> 29 

Gly Ser He Gin Gly Arg Glu Ser Pro Thr Pro Lys Ala Lys Lys Ala 
1 5 10 15 

Ala Asn Ala Lys Lys Asp Leu Val Ser Ser Lys Met Phe Glu Glu Leu 
20 25 30 

Lys Asn Arg Met Asp Val Leu Ala Gin Glu Val Ala Leu Leu Lys Glu 
35 40 45 

Lys Gin Ala Leu Gin Thr Val Val Leu. Lys Gly Thr Lys Val Asn Leu 
50 55 60 

Lys Val Leu Leu Ala Phe Thr Gin Pro Lys Thr Phe His Glu Ala Ser 
65 70 75 80 

Glu Asp Cys He Ser Gin Gly Gly Thr Leu Gly Thr Pro Gin Ser Glu 
85 90 95 

Leu Glu Asn Glu Ala Leu Phe Glu Tyr Ala Arg His Ser Val Gly Asn 
100 105 110 

Asp Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Ala 
115 ~ 120 ^ 125 

Trp Val Asp Met Thr Gly Thr Leu Leu Ala Tyr Lys Asn Trp Glu Thr 
130 135 140 

Glu He Thr Thr Gin Pro Asp Gly Gly Lys Ala Glu Asn Cys Ala Ala 
145 150 155 160 

Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp 
165 170 ~ 175 

Gin Leu Pro Tyr He Cys Gin Phe Ala He Val 
180 185 



<210> 30 

<211> 436 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (1) . . (429) 

<223> FX-mtCTLD encoding insert 

<400> 30 

gga tec ate cag ggt cgc gec tta 
Gly Ser He Gin Gly Arg Ala Leu 
1 . 5 

aag gtg aac ttg aag gtc etc ctg 
Lys Val Asn Leu Lys Val Leu Leu 
20 



cag act gtg gtc ctg aag ggc acc 48 
Gin Thr Val Val Leu Lys Gly Thr 
10 ^ 15 

gec ttc acc caa ccg aag acc ttc 96 
Ala Phe Thr Gin Pro Lys Thr Phe 
25 30 
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cat gag gcg age gag gac tgc ate teg caa ggg ggc acg ctg ggc acc 144 
His Glu Ala Ser Glu Asp Cys lie Ser Gin Gly Gly Thr Leu Gly Thr 
35 40 45 



ccg cag tea gag eta gag aac gag gcg ctg ttc gag tac gcg cgc cac 
Pro Gin Ser Glu Leu Glu Asn Glu Ala Leu Phe Glu Tyr Ala Arg His 
50 55 60 



192 



age gtg ggc aac gat gcg gag ate tgg ctg ggc etc aac gac atg gee 240 
Ser Val Gly Asn Asp Ala Glu lie Trp Leu Gly Leu Asn Asp Met Ala 
65 70 75 * 80 

gcg gaa ggc gec tgg gtg gac atg acc ggt acc etc ctg gee tac aag 288 
Ala Glu Gly Ala Trp Val Asp Met Thr Gly Thr Leu Leu Ala Tyr Lys 
85 90 95 

aac tgg gag acg gag ate acg acg caa ccc gac ggc ggc aaa gee gag 336 
Asn Trp Glu Thr Glu lie Thr Thr Gin Pro Asp Gly Gly Lys Ala Glu 
100 105 110 

aac tgc gee gee ctg tct ggc gca gec aac ggc aag tgg ttc gac aag 384 
Asn Cys Ala Ala Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys 
115 120 125 

cga tgc cgc gat caa ttg ccc tac ate tgc cag ttt gec att gtg 429 
Arg Cys Arg Asp Gin Leu Pro Tyr He Cys Gin Phe Ala He Val 
130 135 140 

taagctt 436 ' 



<210> 31 

<211> 143 

<212> PRT 

<213> Mus musculus 

<400> 31 

Gly Ser He Gin Gly Arg Ala Leu Gin Thr Val Val Leu Lys Gly Thr 
15 10 15 

Lys Val Asn Leu Lys Val Leu Leu Ala Phe Thr Gin Pro Lys Thr Phe 
20 25 30 

His Glu Ala Ser Glu Asp Cys He Ser Gin Gly Gly Thr Leu Gly Thr 
35 40 45 

Pro Gin Ser Glu Leu Glu Asn Glu Ala Leu Phe Glu Tyr Ala Arg His 
50 55 60 

Ser Val Gly Asn Asp Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala 
65 70 75 80 

Ala Glu Gly Ala Trp Val Asp Met Thr Gly Thr Leu Leu Ala Tyr Lys 
85 90 95 

Asn Trp Glu Thr Glu He Thr Thr Gin Pro Asp Gly Gly Lys Ala Glu 
100 • 105 * 110 

Asn Cys Ala Ala Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys 
115 120 125 
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Arg Cys Arg Asp Gin Leu Pro Tyr lie Cys Gin Phe Ala He Val 
130 135 140 



<210> 32 
<211> 47 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 32 

cggctgagcg gcccagccgg ccatggccga gtcacccact cccaagg 47 

<210> 33 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 33 

cctgcggccg ccacgatccc gaactgg 27 

<210> 34 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 34 

cggctgagcg gcccagccgg ccatggccgc cttacagact gtggtc 46 

<210> 35 

<211> 570 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (8) . . (565) 

<223> Pmtlec encoding insert 

<400> 35 

ggcccag ccg gcc atg gcc gag tea ccc act ccc aag gec aag aag get 49 

Pro Ala Met Ala Glu Ser Pro Thr Pro Lys Ala Lys Lys Ala 
15 10 



gca aat gcc aag aaa gat ttg gtg age tea aag atg ttc gag gag etc 97 
Ala Asn Ala Lys Lys Asp Leu Val Ser Ser Lys Met Phe Glu Glu Leu 
15 20 25 30 
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aag aac agg atg gat gtc ctg gcc cag gag gtg gcc ctg ctg aag gag 145 
Lys Asn Arg Met Asp Val Leu Ala Gin Glu Val Ala Leu Leu Lys Glu 
35 40 45 

aag cag gcc tta cag act gtg gtc ctg aag ggc acc aag gtg aac ttg 193 
Lys Gin Ala Leu Gin Thr Val Val Leu Lys Gly Thr Lys Val Asn Leu 
50 55 " 60 

aag gtc etc ctg gcc ttc acc caa ccg aag acc ttc cat gag gcg age 241 
Lys Val Leu Leu Ala Phe Thr Gin Pro Lys Thr Phe His Glu Ala Ser 
65 70 75 

gag gac tgc ate teg caa ggg ggc acg ctg ggc acc ccg cag tea gag 289 
Glu Asp Cys He Ser Gin Gly Gly Thr Leu Gly Thr Pro Gin Ser Glu 
80 85 90 

eta gag aac gag gcg ctg ttc gag tac gcg cgc cac age gtg ggc aac 337 
Leu Glu Asn Glu Ala Leu Phe Glu Tyr Ala Arg His Ser Val Gly Asn 
95 100 105 no 

gat gcg gag ate tgg ctg ggc etc aac gac atg gcc gcg gaa ggc gcc 385 
Asp Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala Ala Glu Gly Ala 
115 120 125 

fc gg gtg gac atg acc ggt acc etc ctg gcc tac aag aac tgg gag acg 433 
Trp Val Asp Met Thr Gly Thr Leu Leu Ala Tyr Lys Asn Trp Glu Thr 
130 135 140 

gag ate acg acg caa ccc gac ggc ggc aaa gcc gag aac tgc gcc gcc 481 
Glu He Thr Thr Gin Pro Asp Gly Gly Lys Ala Glu Asn Cys Ala Ala 
145 150 155 

ctg tct ggc gca gcc aac ggc aag tgg ttc gac aag cga tgc cgc gat 529 
Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp 
160 165 170 

caa ttg ccc tac ate tgc cag ttt gcc att gtg gcg geege 570 
Gin Leu Pro Tyr He Cys Gin Phe Ala lie Val Ala 
175 180 185 

<210> 36 
<211> 186 
<212> PRT 

<213> Mus mus cuius 
<400> 36 

Pro Ala Met Ala Glu Ser Pro Thr Pro Lys Ala Lys Lys Ala Ala Asn 
1 5 10 15 

Ala Lys Lys Asp Leu Val Ser Ser Lys Met Phe Glu Glu Leu Lys Asn 
20 25 30 

Arg Met Asp Val Leu Ala Gin Glu Val Ala Leu Leu Lys Glu Lys Gin 
35 40 45 

Ala Leu Gin Thr Val Val Leu Lys Gly Thr Lys Val Asn Leu Lys Val 
50 55 60 

Leu Leu Ala Phe Thr Gin Pro Lys Thr Phe His Glu Ala Ser Glu Asp 
65 70 75 80 
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Cys He Ser Gin Gly Gly Thr Leu Gly Thr Pro Gin Ser Glu Leu Glu 
85 90 95 

Asn Glu Ala Leu Phe Glu Tyr Ala Arg His Ser Val Gly Asn Asp Ala 
100 105 110 

Glu He Trp Leu Gly Leu Asn Asp Met Ma Ala Glu Gly Ala Trp Val 
115 * 120 125 

Asp Met Thr Gly Thr Leu Leu Ala Tyr Lys Asn Trp Glu Thr Glu He 
130 135 140 

Thr Thr Gin Pro Asp Gly Gly Lys Ala Glu Asn Cys Ala Ala Leu Ser 
145 150 155 160 

Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys Arg Asp Gin Leu 
165 170 175 

Pro Tyr He Cys Gin Phe Ala He Val Ala 
180 185 



<210> 37 

<211> 438 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (8) . . (433) 

<223> PmtCTLD encoding insert 

<400> 37 

ggcccag ccg gcc atg gcc gcc tta cag act gtg gtc ctg aag ggc acc 49 
Pro Ala Met Ala Ala Leu Gin Thr Val Val Leu Lys Gly Thr 
15 10 

aag gtg aac ttg aag gtc etc ctg gcc ttc acc caa ccg aag acc ttc 97 
Lys Val Asn Leu Lys Val Leu Leu Ala Phe Thr Gin Pro Lys Thr Phe 
15 20 25 30 

cat gag gcg age gag gac tgc ate teg caa ggg ggc acg ctg ggc acc 145 
His Glu Ala Ser Glu Asp Cys He Ser Gin Gly Gly Thr Leu Gly Thr 
35 40 45 

ccg cag tea gag eta gag aac gag gcg ctg ttc gag tac gcg cgc cac 193 
Pro Gin Ser Glu Leu Glu Asn Glu Ala Leu Phe Glu Tyr Ala Arg His 
50 55 60 



age gtg ggc aac gat gcg gag ate tgg ctg ggc etc aac gac atg gcc 
Ser Val Gly Asn Asp Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala 
65 70 75 



241 



gcg gaa ggc gcc tgg gtg gac atg acc ggt acc etc ctg gcc tac aag 289 
Ala Glu Gly Ala Trp Val Asp Met Thr Gly Thr Leu Leu Ala Tyr Lys 
80 85 90 

aac t$g gag acg gag ate acg acg caa ccc gac ggc ggc aaa gcc gag 337 
Asn Trp Glu Thr Glu He Thr Thr Gin Pro Asp Gly Gly Lys Ala Glu 
95 100 105 HO 
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aac tgc gcc gcc ctg tct ggc gca gcc aac ggc aag tgg ttc gac aag 385 
Asn Cys Ala Ala Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys 
115 120 ' " 125 

cga tgc cgc gat caa ttg ccc tac ate tgc cag ttt gcc att gtg gcg 433 
Arg Cys Arg Asp Gin Leu Pro Tyr He Cys Gin Phe Ala He Val Ala 
130 135 140 

geege 438 



<210> 38 
<211> 142 
<212> PRT 

<213> Mus musculus 
<400> 38 

Pro Ala Met Ala Ala Leu Gin Thr Val Val Leu Lys Gly Thr Lys Val 
15 10 15 

Asn Leu Lys Val Leu Leu Ala Phe Thr Gin Pro Lys Thr Phe His Glu 
20 25 30 

Ala Ser Glu Asp Cys He Ser Gin Gly Gly Thr Leu Gly Thr Pro Gin 
35 40 45 

Ser Glu Leu Glu Asn Glu Ala Leu Phe Glu Tyr Ala Arg His Ser Val 
50 55 60 

Gly Asn Asp Ala Glu lie Trp Leu Gly Leu Asn Asp Met Ala Ala Glu 
65 70 75 80 

Gly Ala Trp Val Asp Met Thr Gly Thr Leu Leu Ala Tyr Lys Asn Trp 
. 85 90 95 

Glu Thr Glu He Thr Thr Gin Pro Asp Gly Gly Lys Ala Glu Asn Cys 
100 105 110 

Ala Ala Leu Ser Gly Ala Ala Asn Gly Lys Trp Phe Asp Lys Arg Cys 
115 120 *" 125 

Arg Asp Gin Leu Pro Tyr He Cys Gin Phe Ala He Val Ala 
130 135 140 



<210> 39 
<2H> 116 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 39 

cgcctacaag aactggnnsn nsnnsnnsnn snnscaaccc gatnnsnnsn nsnnsgagaa 60 
ctgcgcggtc ctgtcaggcg cggccaacgg caagtggnns gaeaageget gecgeg 116 
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<210> 40 
<211> 31 
<212> DNA 
* <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 40 

gaccggtacc cgcatcgcct acaagaactg g 31 



<210> 41 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 41 

gtagggcaat tgatcgcggc agcgcttgtc 30 



<210> 42 
<211> 94 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 42 

gctgggcctc aacgacnnsn nsnnsgagnn snnstgggtg gacatgaccg gtacccgcat 60 
cgcctacaag aactgggaga ctgagatcac cgcg 94 



<210> 43 
<211> 102 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 43 

cgcggcagcg cttgtcgaac cacttgccgt tggccgcgcc tgacaggacc gcgcagttct 60 
csnnsnnsnn snnatcgggt tgcgcggtga tctcagtctc cc .102 



<210> 44 
<2il> 31 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 44 

cgaggccgag atctggctgg gcctcaacga c 31 



<210> 45 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 45 

gggcaacgag gccgagatct ggctgggcct c 31 



<210> 46 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 46 

cctgaccctg cagcgcttg 19 



<210> 47 
<211> 81 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 47 

cgagatctgg ctgggcctca acgacnnsnn snnsnnsnns nnsgagggca cctgggtgga 60 
catgaccggt acccgcatcg c " 81 



<210> 48 
<211> 78 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 48 

cgagatctgg ctgggcctca acgacnnsnn snnsnnsnns gagggcacct gggtggacat 60 
gaccggtacc cgcatcgc 78 
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<210> 49 
<211> 94 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 49 

gctgggcctc aacgacnnsn nsnnsgagnn snnstgggtg gacatgaccg gtacccgcat 60 
cgcctacaag aactgggaga ctgagatcac cgcg 94 

<210> 50 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 50 

gcgatgcggg taccggtc 18 



<210> 51 
<211> 89 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 51 

gcatcgccta caagaactgg gagactgaga tcaccgcgca acccgatggc ggcnnsnnsn 60 
nsnnsnnsnn sgagaactgc gcggtcctg 89 

<210> 52 
<211> 86 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 52 

gcatcgccta caagaactgg gagactgaga tcaccgcgca acccgatggc ggcnnsnnsn 60 
nsnnsnnsga gaactgcgcg gtcctg 86 

<210> 53 

<211> 34 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 53 

catgaccggt acccgcatcg cctacaagaa ctgg 34 

<210> 54 
<211> 66 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 54 

cctgaccctg cagcgcttgt cgaaccactt gccgttggcc gcgcctgaca ggaccgcgca 60 



<210> 55 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 55 

ggtacctaag tgacgatatc ctgacctaac tgcagggatc aattg 45 

<210> 56 

<211> 343 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (8) . . (274) 

<223> Human PhtCPB insert 

<400> 56 

ggcccag ccg gcc atg gcc gcc etc cag acg gtc tgc ctg aag ggg acc 4 9 

Pro Ala Met Ala Ala Leu Gin Thr Val Cys Leu Lys Gly Thr 
1 5 10 

aag gtg cac atg aaa tgc ttt ctg gcc ttc acc cag acg aag acc ttc 97 
Lys Val Has Met Lys Cys Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe 
15 20 25 30 

cac gag gcc age gag gac tgc ate teg cgc ggg g gc acc ctg age acc 145 
His Glu Ala Ser Glu Asp Cys He Ser Arg Gly Gly Thr Leu Ser Thr 
35 40 ~ 45 



cct cag act ggc teg gag aac gac gcc ctg tat gag tac ctg cgc cag 
Pro Gin Thr Gly Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin 
50 55 - 6 o 



193 
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age gtg ggc aac gag gec gag ate tgg ctg ggc etc aac gac atg gcg 241 
Ser Val Gly Asn Glu Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala 
65 70 75 

gec gag ggc acc tgg gtg gac atg ace ggt acc taagtgacga tatcctgacc 294 
Ala Glu Gly Thr Trp Val Asp Met Thr Gly Thr 
80 85 

taactgeagg gatcaattgc cctacatctg ccagttcggg atcgtgtag 343 



<210> 57 
<211> 89 
<212> PRT 

<213> Homo sapiens 
<400> 57 

Pro Ala Met Ala Ala Leu Gin Thr Val Cys Leu Lys Gly Thr Lys Val 
15 10 15 



His Met Lys Cys Phe Leu Ala Phe Thr Gin Thr Lys Thr Phe His Glu 
20 25 30 

Ala Ser Glu Asp Cys He Ser Arg Gly Gly Thr Leu Ser Thr Pro Gin 
35 40 45 

Thr Gly Ser Glu Asn Asp Ala Leu Tyr Glu Tyr Leu Arg Gin Ser Val 
50 55 60 



Gly Asn Glu Ala Glu He Trp Leu Gly Leu Asn Asp Met Ala Ala Glu 
65 70 75 80 

Gly Thr Trp Val Asp Met Thr Gly Thr 
85 



<210> 58 
<211> 405 
<212> DNA 

<213> Rattus rattus 

<220> 

<221> CDS 

<222> (8) . . (400) 

<223> Rat PrMBP insert 

<400> 58 

ggcccag ccg gec atg gec aac aag ttg cat gec ttc tec atg ggt aaa 49 
Pro Ala Met Ala Asn Lys Leu His Ala Phe Ser Met Gly Lys 
1 5 10 

aag tct ggg aag aag ttc ttt gtg acc aac cat gaa agg atg ccc ttt 97 
Lys Ser Gly Lys Lys Phe Phe Val Thr Asn His Glu Arg Met Pro Phe 
15 20 25 30 



tec aaa gtc 
Ser Lys Val 



aag gec ctg tgc tea gag etc cga ggc act gtg get ate 145 
Lys Ala Leu Cys Ser Glu Leu Arg Gly Thr Val Ala He 
35 40 45 
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ccc aag aat get gag gag aac aag gec ate caa gaa gtg get aaa acc 193 
Pro Lys Asn Ala Glu Glu Asn Lys Ala He Gin Glu Val Ala Lys Thr 
50 55 60 

tct gec ttc eta ggc ate acg gac gag gtg act gaa ggc caa ttc atg 241 
Ser Ala Phe Leu Gly He Thr Asp Glu Val Thr Glu Gly Gin Phe Met 
65 70 75 

tat gtg aca ggg ggg agg etc acc tac age aac tgg aaa aag gat gag 289 
Tyr Val Thr Gly Gly Arg Leu Thr Tyr Ser Asn Trp Lys Lys Asp Glu 
80 85 90 

ccc aat gac cat ggc tct ggg gaa gac tgt gtc act ata gta gac aac 337 
Pro Asn Asp His Gly Ser Gly Glu Asp Cys Val Thr He Val Asp Asn 
9 5 100 105 HO 

ggt ctg tgg aat gac ate tec tgc caa get tec cac acg get gtc tgc 385 
Gly Leu Trp Asn Asp He Ser Cys Gin Ala Ser His Thr Ala Val Cys 
115 120 125 

gag ttc cca gec gcg geege 405 
Glu Phe Pro Ala Ala 
130 



<210> 59 
<211> 131 
<212> PRT 

<213> Rattus rattus 
<400> 59 

Pro Ala Met Ala Asn Lys Leu His Ala Phe Ser Met Gly Lys Lys Ser 
^5 10 15 

Gly Lys Lys Phe Phe Val Thr Asn His Glu Arg Met Pro Phe Ser Lys 
20 25 30 

Val Lys Ala Leu Cys Ser Glu Leu Arg Gly Thr Val Ala He Pro Lys 
35 40 45 

Asn Ala Glu Glu Asn Lys Ala He Gin Glu Val Ala Lys Thr Ser Ala 
50 55 60 

Phe Leu Gly He Thr Asp Glu Val Thr Glu Gly Gin Phe Met Tyr Val 
65 70 75 80 

Thr Gly Gly Arg Leu Thr Tyr Ser Asn Trp Lys Lys Asp Glu Pro Asn 
85 90 95 

Asp His Gly Ser Gly Glu Asp Cys Val Thr He Val Asp Asn Gly Leu 
100 105 110 

Trp Asn Asp He Ser Cys Gin Ala Ser His Thr Ala Val Cys Glu Phe 
H5 120 125 

Pro Ala Ala 
130 
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<210> 60 

<211> 408 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (8) . . (403) 

<223> Human PhSP-D insert 

<400> 60 

ggcccag.ccg gcc atg gcc aag aaa gtt gag etc ttc cca aat ggc caa 49 
Pro Ala Met Ala Lys Lys Val Glu Leu Phe Pro Asn Gly Gin 
15 10 

agt gtg ggg gag aag att ttc aag ac^ gca ggc ttt gta aaa cca ttt 97 
Ser Val Gly Glu Lys lie Phe Lys Thr Ala Gly Phe Val Lys Pro Phe 
15 20 25 30 

acg gag gca cag ctg ctg tgc aca cag get ggt gga cag ttg gcc tct 145 
Thr Glu Ala Gin Leu Leu Cys Thr Gin Ala Gly Gly Gin Leu Ala Ser 
35 40 45 

cca cgc tct gcc get gag aat gcc gcc ttg caa cag ctg gtc gta get 193 
Pro Arg Ser Ala Ala Glu Asn Ala Ala Leu Gin Gin Leu Val Val Ala 
50 55 60 

aag aac gag get get ttc ctg age atg act gat tec aag aca gag ggc 241 
Lys Asn Glu Ala Ala Phe Leu Ser Met Thr Asp Ser Lys Thr Glu Gly 
65 70 75 

aag ttc ace tac ccc aca gga gag tec ctg gtc tat tec aac tgg gcc 289 
Lys Phe Thr Tyr Pro Thr Gly Glu Ser Leu Val Tyr Ser Asn Trp Ala 
80 " 85 90 

cca ggg gag ccc aac gat gat ggc ggg tea gag gac tgt gtg gag ate 337 
Pro Gly Glu Pro Asn Asp Asp Gly Gly Ser Glu Asp Cys Val Glu lie 
95 100 105 110 

ttc ace aat ggc aag tgg aat gac agg get tgt gga gaa aag cgt ctt 385 
Phe Thr Asn Gly Lys Trp Asn Asp Arg Ala Cys Gly Glu Lys Arg Leu 
115 120 125 

gtg gtc tgc gag ttc gcg geege 408 
Val Val Cys Glu Phe Ala 
130 



<210> 61 
<211> 132 
<212> PRT 

<213> Homo sapiens 
<400> 61 

Pro Ala Met Ala Lys Lys Val Glu Leu Phe Pro Asn Gly Gin Ser Val 
1.5 10 15 

Gly Glu Lys He Phe Lys Thr Ala Gly Phe Val Lys Pro Phe Thr Glu 
20 25 30 
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Ala Gin Leu Leu Cys Thr 

35. 

Ser Ala Ala Glu Asn Ala 
50 

Glu Ala Ala Phe Leu Ser 
65 70 

Thr Tyr Pro Thr Gly Glu 
85 

Glu Pro Asn Asp Asp Gly 
100 

Asn Gly Lys Trp Asn Asp 
115 

Cys Glu Phe Ala 
130 . 



<210> 62 
<211> 49 
<212> DNA 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 62 

cggctgagcg gcccagccgg ccatggccaa caagttgcat gccttctcc 49 

<210> 63 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 63 

gcactcctgc ggccgcggct gggaactcgc agac 34 

<210> 64 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 



Gin Ala Gly Gly Gin Leu Ala Ser Pro Arg 
40 45 

Ala Leu Gin Gin Leu Val Val Ala Lys Asn 
55 60 

Met Thr Asp Ser Lys Thr Glu Gly Lys Phe 
75 60 

Ser Leu Val Tyr Ser Asn Trp Ala Pro Gly 
90 95 

Gly Ser Glu Asp Cys Val Glu He Phe Thr 
105 110 



Arg Ala Cys Gly Glu Lys Arg Leu Val Val 
120 125 



<400> 64 

cggctgagcg gcccagccgg ccatggccaa gaaagttgag ctcttccc 



48 
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<210> 65 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 65 

gcactcctgc ggccgcgaac tcgcagacca caagac 



<210> 66 
<211> 65 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 66 

gccaccggtg acgtagatga attggccttc snnsnnsnns nnsnngtccg tgatgcctag 60 



<210> 67 
<211> 68 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 67 

gccaccggtg acgtagatga attggccttc snnsnnsnns rinsnnsnngt ccgtgatgcc 60 
taggaagg 68 



<210> 68 
<211> 62 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 68 

gccaccggtg acgtagatga asnnsnnsnn snnsnnsnns nncgtgatgc ctaggaaggc 60 
ag " 62 



<210> 69 
<211> 40 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: 

oligonucleotide ■ 

<400> 69 

ccagttgctg tatttcaggc tgccaccggt gacgtagatg 40 



<210> 70 

<211> 34 / 
<212> DNA i 
<213> Artificial. Sequence \ 

<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 70 

gcctgaaata cagcaactgg aagaaagacg aacc 34 



<210> 71 
<211> 68 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 71 

ctggaagaaa gacgaaccga atgaccatgg cnnsnnsnns nnsnnsgaag actgtgtcac 60 
tatagtag 68 



<210> 72 
<211> 71 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 72 

ctggaagaaa gacgaaccga atgaccatgg cnnsnnsnns nnsnnsnnsg aagactgtgt 60 
cactatagta g 71 



<210> 73 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 73 

ctggaagaaa gacgaaccga atnnsnnsnn snnsnnsgaa gactgtgtca ctatagtag 59 
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<210> 74 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 74 

cggctgagcg gcccagc 



<210> 75 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 75 

gcactcctgc ggccgcg 



<210> 76 
<211> .69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 76 . 4. 4. 

ctcaccggtc ggatacgtga acttgccctc tgtsnnsnns nnsnnsnnat cagtcatgct 
caggaaagc 



<210> 77 

<211> 72 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 77 X. V 

ctcaccggtc ggatacgtga acttgccctc tgtsnnsnns nnsnnsnnsn natcagtcat w 
gctcaggaaa gc 



<210> 78 
<211> 60 
<212> DNA - 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
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oligonucleotide 
<400> 78 

ctcaccggtc ggatacgtga asnnsnnsnn snnsnnsnns nnagtcatgc tcaggaaagc 60 



<210> 79 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 79 

cagttggaat agaccaggga ctcaccggtc ggatacgtg 39 



<210> 80 
<211> 65 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 80 

gggccccagg ggagcccaac gatgatggcn nsnnsnnsnn snnsgaggac tgtgtggaga 60 
tcttc 65 



<210> 81 
<211> 68 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 81 

gggccccagg ggagcccaac gatgatggcn nsnnsnnsnn snnsnnsgag gactgtgtgg 60 
agatcttc 68 



<210> 82 
<211> 68 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 



<400> 82 

gggccccagg ggagcccaac gatgatggcn nsnnsnnsnn snnsnnsgag gactgtgtgg 60 
agatcttc 68 
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<210> 83 
<211> 56 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 83 

gggccccagg ggagcccaac nnsnnsnnsn nsnnsgagga ctgtgtggag atcttc 3t> 



<210> 84 
<211> 77 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 84 

gcatcgccta caagaactgg nnsnnsnnsn nsnnsnnsca acccgatggc ggcaagaccg e 
agaactgcgc ggtcctg 



<210> 85 
<211> 83 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 



<400> 85 

gcatcgccta caagaactgg gagnnsnnsn nsnnsnnsnn sgcgcaaccc gatggcggca 
agaccgagaa ctgcgcggtc ctg 



<210> 86 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 86 

gcatcgccta caagaactgg gagnnsnnsn nsnnsnnsgc gcaacccgat ggcggcaaga.. 
ccgagaactg cgcggtcctg 

<210> 87 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 



WO 02/48189 



34 



PCT/DK01/00825 



<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 87' 

gtagggcaat tgatcgctgc agcgcttgtc gaaccasnns nnsnnsnnsn nsnnsnncag 60 
gaccgcgcag ttctc 75 

<210> 88 
<211> 84 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 88 

gtagggcaat tgatcgctgc agcgcttgtc gaaccacttg ccsnnsnnsn nsnnsnnsnn 60 
gcctgacagg accgcgcagt tctc 34 

<210> 89 
<211> 81 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 89 

gtagggcaat tgatcgctgc agcgcttgtc gaaccacttg ccsnnsnnsn nsnnsnngcc 60 
tgacaggacc gcgcagttct c 81 

<210> 90 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 90 

gtagggcaat tgatcgctgc 20 

<210> 91 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 



<400>. 91 

catgaccggt acccgcatcg cctacaagaa ctgg 
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(57) Abstract: A novel family of protein libraries comprising CTLDS (C-type Lectin-Like Domains) in which internal polypeptide 
loop-regions lining the ligantl binding sites in CTLDs have been replaced with ensembles of completely or partially randomised 
polypeptide segments. Tetranectin CTLDs were chosen as framework for the preferred embodiment of the invention; and versatile 
phagemid vectors useful in the generation and manipulation of human and murine tetranectin CfLD libraries are disclosed as part 
of this invention. Tetranectin CTLDs in monomeric as well as in Irimeric form are cfllcicnlly displayed as gene 111 fusions in fully 
functional form by the recombinant fd phage display vector. CTLD derivatives with affinity for new ligands may readily be isolated 
from libraries of vectors displaying CTLDs, in which loop-regions have been randomised, using one or more rounds of enrichment 
by screening or selection followed by amplification of the enriched subpopulation in each round. The efficiency with which protein 
products containing CrLDs with new binding properties can be produced, e.g. by bacterial expression in in vitro refolding, in mono-, 
tri-, or mullimeric formats provides important advantages in terms of simplicity, cost and efficiency of generation, production and 
diagnostic or therapeutic applications in comparison to recombinant antibody derivatives. 
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